about me


my software


Yucatán Photos

St Lucia Photos

Photo Album



< June 2005 >
    1 2 3 4
5 6 7 8 91011

past articles »

Click for San Francisco, California Forecast

San Francisco, USA


Python's half open index notation

Beginner programmers often wonder about Python's sequence indexing and slicing notation. Array index starts from 0. Slicing uses half open notations, where L[a:b] is a subsequence with index x where a <= x < b.

Why is the endpoint excluded? Isn't it more intuitive if array index starts from 1 and the endpoint is included, so that a 3 elements array is referenced as L[1:3] with items L[1], L[2], L[3]?

It turns out this notation is an elegant and deliberate design and it has some excellent properties.

We write programs to operate on arrays, to find their length, traverse the subsequences, split them or join them. The half open notation always show a simple pattern. But the inclusive notation often requires adding 1 or substracting 1 to the indexes in many operations. Thus it is more vulnerable to off-by-one-error. This article One True Way of array indexing discuss this at length. I have reproduce its example (with corrections) below:

Operation Half open Inclusive
length of a slice L[a:b] b-a (b-a+1)
first n characters of L L[:n] L[1:n]
last n characters of L L[-n:] L[len(L)-n+1:]
The identity slice L == L[0:len(L)] == L[:] L[1:len(L)]
The empty slice L[a:a] is empty for any a. perhaps L[a:a-1]?
A slice of length n, from point a L[a:a+n] L[a:a+n-1]
Split L[a:b] at index c L[a:b] == L[a:c]+L[c:b] L[a:c-1]+L[c:b]

Another important property is an empty sequence can be expressed by L[a:a], while there is no natural way to express an empty sequence with the inclusive notation. But do we really need to care about a special case? Absolutely! In fact failure to account for empty input is one of the most common error. Just like zero is a fundamental concept in mathematics, always think how you program can handle null input. An inferior approach is to represent empty sequence by None or null pointer. This creates a special case so that a variable need to be tested before dereferencing. Failure to do so contributes to unexpected exceptions. It is an elegant design that L[a:b] can also represent sequences with 0 length.

C++'s STL also choose this notation to represent a range. According to the literature this is crucial because "algorithms that operate on n things frequently require n+1 positions. Linear search, for example (find) must be able to return some value to indicate that the search was unsuccessful." I have seen so many people flunked link list or data structure exercises because they have trouble dealing with the end of a list. Often a good solution is shift the focus beyond the n concrete objects to the n+1 positions around them. I hope this help to make sense of the half open notation.

2005.06.16 [, ] - comments



blog comments powered by Disqus

past articles »


BBC News


Irish Church needs 'reality check' (24 May 2015)


US officer cleared over deaths (24 May 2015)


Sweden scoops Eurovision 2015 crown (24 May 2015)


Putin signs 'undesirable' NGOs bill (24 May 2015)


Cameron urged to make case for EU (24 May 2015)


Spain to vote in regional elections (24 May 2015)


Leading Ukraine rebel chief 'killed' (23 May 2015)


Somali MP killed in Mogadishu attack (23 May 2015)


Polish voters to decide on president (23 May 2015)


German woman, 65, 'has quadruplets' (23 May 2015)

more »


Slashdot News for nerds, stuff that matters


Study: Science Still Seen As a Male Profession (2015-05-24T00:04:00+00:00)


The Hoverboard Flies Closer To Reality (2015-05-23T23:01:00+00:00)


Google Developing 'Brillo' OS For Internet of Things (2015-05-23T21:59:00+00:00)


Oculus Founder Hit With Lawsuit (2015-05-23T20:53:00+00:00)


Ireland Votes Yes To Same-Sex Marriage (2015-05-23T19:45:00+00:00)


Ask Slashdot: Can SaaS Be Both Open Source and Economically Viable? (2015-05-23T18:49:00+00:00)


Death In the Browser Tab (2015-05-23T17:44:00+00:00)


Researchers Devise Voting System That Seems Secure, But Is Hard To Use (2015-05-23T16:41:00+00:00)

more »


TechPsychic Tech Rumors and Invented News

more »


SF Gate


Bay Area News (7 Jan 2012)


City Insider (11 Feb 2012)


Crime Scene (13 Feb 2012)


C.W Newius Column (10 Jan 2012)


C.W. Nevius Blog (11 Feb 2012)


Education News (10 Jan 2012)


KALW (11 Feb 2012)


Matier and Ross Blog (11 Feb 2012)


Apple bets big on China (23 May 2015)


2-star therapist? Why online reviews give psychiatrists anxiety (23 May 2015)


Airlines expect record number of summer passengers (23 May 2015)


Whispers of a bubble in Silicon Valley (23 May 2015)


Cheap oil can hurt the economy, but experts missed it (23 May 2015)


Daily Briefing, May 24 (23 May 2015)

more »


Asia Times Online


China ramps up charges against Zhou (Fri 20 Mar 2015 11:00:00 GMT)


'100 dead' in Myanmar fighting (Fri 20 Mar 2015 11:00:00 GMT)


Tunisian president vows no mercy (Fri 20 Mar 2015 11:00:00 GMT)


SPENGLER Israel's 'referendum' on 'two-state solution' (Fri 20 Mar 2015 11:00:00 GMT)


Russia, S Ossetia sign 'integration' pact (Fri 20 Mar 2015 11:00:00 GMT)


US military plunges Aquino into crisis (Fri 20 Mar 2015 11:00:00 GMT)


Rahmon celebrates Tajik democracy (Fri 20 Mar 2015 11:00:00 GMT)


THE BEAR'S LAIR Being old in 2040 no fun (Fri 20 Mar 2015 11:00:00 GMT)


China grant boosts Nepal ties (Fri 20 Mar 2015 11:00:00 GMT)

more »


Site feed Updated: 2015-May-23 21:00