tungwaiyip.info

home

about me

links

Blog

< June 2005 >
SuMoTuWeThFrSa
    1 2 3 4
5 6 7 8 91011
12131415161718
19202122232425
2627282930  

past articles »

Click for San Francisco, California Forecast

San Francisco, USA

 

Python's half open index notation

Beginner programmers often wonder about Python's sequence indexing and slicing notation. Array index starts from 0. Slicing uses half open notations, where L[a:b] is a subsequence with index x where a <= x < b.

Why is the endpoint excluded? Isn't it more intuitive if array index starts from 1 and the endpoint is included, so that a 3 elements array is referenced as L[1:3] with items L[1], L[2], L[3]?

It turns out this notation is an elegant and deliberate design and it has some excellent properties.

We write programs to operate on arrays, to find their length, traverse the subsequences, split them or join them. The half open notation always show a simple pattern. But the inclusive notation often requires adding 1 or substracting 1 to the indexes in many operations. Thus it is more vulnerable to off-by-one-error. This article One True Way of array indexing discuss this at length. I have reproduce its example (with corrections) below:

Operation Half open Inclusive
length of a slice L[a:b] b-a (b-a+1)
first n characters of L L[:n] L[1:n]
last n characters of L L[-n:] L[len(L)-n+1:]
The identity slice L == L[0:len(L)] == L[:] L[1:len(L)]
The empty slice L[a:a] is empty for any a. perhaps L[a:a-1]?
A slice of length n, from point a L[a:a+n] L[a:a+n-1]
Split L[a:b] at index c L[a:b] == L[a:c]+L[c:b] L[a:c-1]+L[c:b]

Another important property is an empty sequence can be expressed by L[a:a], while there is no natural way to express an empty sequence with the inclusive notation. But do we really need to care about a special case? Absolutely! In fact failure to account for empty input is one of the most common error. Just like zero is a fundamental concept in mathematics, always think how you program can handle null input. An inferior approach is to represent empty sequence by None or null pointer. This creates a special case so that a variable need to be tested before dereferencing. Failure to do so contributes to unexpected exceptions. It is an elegant design that L[a:b] can also represent sequences with 0 length.

C++'s STL also choose this notation to represent a range. According to the literature this is crucial because "algorithms that operate on n things frequently require n+1 positions. Linear search, for example (find) must be able to return some value to indicate that the search was unsuccessful." I have seen so many people flunked link list or data structure exercises because they have trouble dealing with the end of a list. Often a good solution is shift the focus beyond the n concrete objects to the n+1 positions around them. I hope this help to make sense of the half open notation.

2005.06.16 [, ] - comments

 

 

blog comments powered by Disqus

past articles »

 

BBC News

 

Ex-FTX boss Bankman-Fried: 'I didn't try to commit fraud' (01 Dec 2022)

 

Cyril Ramaphosa: South African president faces threat of impeachment over 'Farmgate' (01 Dec 2022)

 

Christine McVie, Fleetwood Mac singer-songwriter, dies aged 79 (01 Dec 2022)

 

Lady Susan Hussey quits over remarks to charity boss Ngozi Fulani (30 Nov 2022)

 

Spectacular scenes as Hawaii volcano erupts (30 Nov 2022)

 

Danny Masterson: Jury fails to reach a verdict in rape case against That 70s Show actor (01 Dec 2022)

 

Canada: Ambassador tells EU that deforestation rules 'burdensome' (01 Dec 2022)

 

Idaho stabbings: Families deliver tributes for the four victims at vigil (01 Dec 2022)

 

San Francisco to allow police 'killer robots' (30 Nov 2022)

 

Musk says Twitter feud with Apple boss 'resolved' (01 Dec 2022)

more »

 

SF Gate

 

San Francisco food delivery giant DoorDash lays off 1,250 employees (30 Nov 2022)

 

20-year-old California student creates viral Spotify festival poster app (29 Nov 2022)

 

Elon Musk’s Twitter cuts COVID misinformation policy, guts child safety protections (29 Nov 2022)

 

Elon Musk’s Twitter is reportedly failing to pay some employees (28 Nov 2022)

 

Bay Area tech mainstay HP to lay off up to 6,000 people (26 Nov 2022)

 

Elon Musk signals his support for Ron DeSantis in 2024 on Twitter (25 Nov 2022)

more »


Site feed Updated: 2022-Nov-30 21:00