tungwaiyip.info

home

about me

links

my software

Media

Yucatán Photos

St Lucia Photos

Photo Album

Videos

Blog

< June 2005 >
SuMoTuWeThFrSa
    1 2 3 4
5 6 7 8 91011
12131415161718
19202122232425
2627282930  

past articles »

Click for San Francisco, California Forecast

San Francisco, USA

 

Python's half open index notation

Beginner programmers often wonder about Python's sequence indexing and slicing notation. Array index starts from 0. Slicing uses half open notations, where L[a:b] is a subsequence with index x where a <= x < b.

Why is the endpoint excluded? Isn't it more intuitive if array index starts from 1 and the endpoint is included, so that a 3 elements array is referenced as L[1:3] with items L[1], L[2], L[3]?

It turns out this notation is an elegant and deliberate design and it has some excellent properties.

We write programs to operate on arrays, to find their length, traverse the subsequences, split them or join them. The half open notation always show a simple pattern. But the inclusive notation often requires adding 1 or substracting 1 to the indexes in many operations. Thus it is more vulnerable to off-by-one-error. This article One True Way of array indexing discuss this at length. I have reproduce its example (with corrections) below:

Operation Half open Inclusive
length of a slice L[a:b] b-a (b-a+1)
first n characters of L L[:n] L[1:n]
last n characters of L L[-n:] L[len(L)-n+1:]
The identity slice L == L[0:len(L)] == L[:] L[1:len(L)]
The empty slice L[a:a] is empty for any a. perhaps L[a:a-1]?
A slice of length n, from point a L[a:a+n] L[a:a+n-1]
Split L[a:b] at index c L[a:b] == L[a:c]+L[c:b] L[a:c-1]+L[c:b]

Another important property is an empty sequence can be expressed by L[a:a], while there is no natural way to express an empty sequence with the inclusive notation. But do we really need to care about a special case? Absolutely! In fact failure to account for empty input is one of the most common error. Just like zero is a fundamental concept in mathematics, always think how you program can handle null input. An inferior approach is to represent empty sequence by None or null pointer. This creates a special case so that a variable need to be tested before dereferencing. Failure to do so contributes to unexpected exceptions. It is an elegant design that L[a:b] can also represent sequences with 0 length.

C++'s STL also choose this notation to represent a range. According to the literature this is crucial because "algorithms that operate on n things frequently require n+1 positions. Linear search, for example (find) must be able to return some value to indicate that the search was unsuccessful." I have seen so many people flunked link list or data structure exercises because they have trouble dealing with the end of a list. Often a good solution is shift the focus beyond the n concrete objects to the n+1 positions around them. I hope this help to make sense of the half open notation.

2005.06.16 [, ] - comments

 

 

blog comments powered by Disqus

past articles »

 

BBC News

 

US probes 'stray Syria air drop' (22 Oct 2014)

 

Ebola serum for Africa 'in weeks' (21 Oct 2014)

 

Physics reveals how nature sparkles (22 Oct 2014)

 

S Korea takes down 'Christmas tree' (22 Oct 2014)

 

Briton found with throat cut in Bali (22 Oct 2014)

 

US editor Ben Bradlee dies at 93 (22 Oct 2014)

 

UN chief plans Gaza conflict inquiry (22 Oct 2014)

 

Father of IS hostage John Cantlie dies (21 Oct 2014)

 

Yahoo profits surge on Alibaba sale (21 Oct 2014)

 

DR Congo doctor wins Sakharov prize (21 Oct 2014)

more »

 

Slashdot News for nerds, stuff that matters

 

Fiber Optics In Antarctica Will Monitor Ice Sheet Melting (2014-10-22T00:06:00Z)

 

Microsoft Introduces Build Cadence Selection With Windows 10 (2014-10-21T23:24:00Z)

 

Ebola Does Not Require an "Ebola Czar," Nor Calling Up the National Guard (2014-10-21T22:41:00Z)

 

Google Leads Funding Round For Augmented Reality Wearables Company (2014-10-21T21:58:00Z)

 

The Future of Stamps (2014-10-21T21:15:00Z)

 

The Bogus Batoid Submarine is Wooden, not Yellow (Video) (2014-10-21T20:32:00Z)

 

Samsung Acknowledges and Fixes Bug On 840 EVO SSDs (2014-10-21T19:46:00Z)

 

NPR: '80s Ads Are Responsible For the Lack of Women Coders (2014-10-21T19:04:00Z)

more »

 

TechPsychic Tech Rumors and Invented News

more »

 

SF Gate

 

Bay Area News (7 Jan 2012)

 

City Insider (11 Feb 2012)

 

Crime Scene (13 Feb 2012)

 

C.W Newius Column (10 Jan 2012)

 

C.W. Nevius Blog (11 Feb 2012)

 

Education News (10 Jan 2012)

 

KALW (11 Feb 2012)

 

Matier and Ross Blog (11 Feb 2012)

 

NewLink stock soars amid deal with Genentech for cancer therapies (21 Oct 2014)

 

AbbVie scraps billion Shire deal on U.S. tax changes (21 Oct 2014)

 

10 cool office spaces (21 Oct 2014)

 

Ship traffic (21 Oct 2014)

 

Microsoft’s CEO admits error (21 Oct 2014)

 

IBM woes cost Buffett more than million (21 Oct 2014)

more »

 

Asia Times Online

 

Rouhani's 'economic package' is empty (Tue 21 Oct 2014 11:00:00 GMT)

 

Low hopes for Hong Kong talks (Tue 21 Oct 2014 11:00:00 GMT)

 

Ebola and security opportunities lost (Tue 21 Oct 2014 11:00:00 GMT)

 

What could possibly go wrong? (Tue 21 Oct 2014 11:00:00 GMT)

 

Britain's phantoms of the past in Palestine (Tue 21 Oct 2014 11:00:00 GMT)

 

Pictures of life on North Korean tourist trail (Tue 21 Oct 2014 11:00:00 GMT)

 

China bids to curb mounting local debt (Tue 21 Oct 2014 11:00:00 GMT)

 

THE BEAR'S LAIR When socialism can 'work' (Tue 21 Oct 2014 11:00:00 GMT)

more »

 


Site feed Updated: 2014-Oct-22 00:00