tungwaiyip.info

home

about me

links

Blog

< June 2005 >
SuMoTuWeThFrSa
    1 2 3 4
5 6 7 8 91011
12131415161718
19202122232425
2627282930  

past articles »

Click for San Francisco, California Forecast

San Francisco, USA

 

Python's half open index notation

Beginner programmers often wonder about Python's sequence indexing and slicing notation. Array index starts from 0. Slicing uses half open notations, where L[a:b] is a subsequence with index x where a <= x < b.

Why is the endpoint excluded? Isn't it more intuitive if array index starts from 1 and the endpoint is included, so that a 3 elements array is referenced as L[1:3] with items L[1], L[2], L[3]?

It turns out this notation is an elegant and deliberate design and it has some excellent properties.

We write programs to operate on arrays, to find their length, traverse the subsequences, split them or join them. The half open notation always show a simple pattern. But the inclusive notation often requires adding 1 or substracting 1 to the indexes in many operations. Thus it is more vulnerable to off-by-one-error. This article One True Way of array indexing discuss this at length. I have reproduce its example (with corrections) below:

Operation Half open Inclusive
length of a slice L[a:b] b-a (b-a+1)
first n characters of L L[:n] L[1:n]
last n characters of L L[-n:] L[len(L)-n+1:]
The identity slice L == L[0:len(L)] == L[:] L[1:len(L)]
The empty slice L[a:a] is empty for any a. perhaps L[a:a-1]?
A slice of length n, from point a L[a:a+n] L[a:a+n-1]
Split L[a:b] at index c L[a:b] == L[a:c]+L[c:b] L[a:c-1]+L[c:b]

Another important property is an empty sequence can be expressed by L[a:a], while there is no natural way to express an empty sequence with the inclusive notation. But do we really need to care about a special case? Absolutely! In fact failure to account for empty input is one of the most common error. Just like zero is a fundamental concept in mathematics, always think how you program can handle null input. An inferior approach is to represent empty sequence by None or null pointer. This creates a special case so that a variable need to be tested before dereferencing. Failure to do so contributes to unexpected exceptions. It is an elegant design that L[a:b] can also represent sequences with 0 length.

C++'s STL also choose this notation to represent a range. According to the literature this is crucial because "algorithms that operate on n things frequently require n+1 positions. Linear search, for example (find) must be able to return some value to indicate that the search was unsuccessful." I have seen so many people flunked link list or data structure exercises because they have trouble dealing with the end of a list. Often a good solution is shift the focus beyond the n concrete objects to the n+1 positions around them. I hope this help to make sense of the half open notation.

2005.06.16 [, ] - comments

 

 

blog comments powered by Disqus

past articles »

 

BBC News

 

Raqqa: IS 'capital' falls to US-backed Syrian forces (17 Oct 2017)

 

Catalonia: Spain detains two separatists (17 Oct 2017)

 

The Hollywood women tackling sexual harassment (17 Oct 2017)

 

Malta journalist death: Caruana Galizia's son denounces "mafia state" (17 Oct 2017)

 

Portugal fires: Three days of national mourning for wildfire victims (17 Oct 2017)

 

Kirkuk: Iraqi forces seize largest oilfields near city (17 Oct 2017)

 

China congress: Why Beijing has banned hot air balloons (16 Oct 2017)

 

Philippine conflict: Duterte says Marawi is militant-free (17 Oct 2017)

 

'Anne Frank' children's costume sparks controversy (17 Oct 2017)

 

Tucson, Arizona, trailer fire 'caused by spider burning' (17 Oct 2017)

more »

 

SF Gate

 

Bay Area News (7 Jan 2012)

 

City Insider (11 Feb 2012)

 

Crime Scene (13 Feb 2012)

 

C.W Newius Column (10 Jan 2012)

 

C.W. Nevius Blog (11 Feb 2012)

 

Education News (10 Jan 2012)

 

KALW (11 Feb 2012)

 

Matier and Ross Blog (11 Feb 2012)

 

California man cited for flying drone over airport, impeding firefighters (16 Oct 2017)

 

Court to rule on forcing tech firms to provide data held abroad (16 Oct 2017)

 

Fed’s Yellen says the economy remains in good health (16 Oct 2017)

 

After the smoke clears: Wine Country economy expected to rebound (15 Oct 2017)

 

Updated: Where fire victims can apply for tax relief and FEMA grants and loans (14 Oct 2017)

 

Twitter CEO Jack Dorsey promises tougher stance on abusive tweets (14 Oct 2017)

more »

 


Site feed Updated: 2017-Oct-17 06:00