tungwaiyip.info

home

about me

links

Media

Yucatán Photos

St Lucia Photos

Photo Album

Videos

Blog

< November 2007 >
SuMoTuWeThFrSa
     1 2 3
4 5 6 7 8 910
11121314151617
18192021222324
252627282930 

past articles »

Click for San Francisco, California Forecast

San Francisco, USA

 

Jyutping (Cantonese Pronunciation) Table

Initially my blog was focused on technical IT subject. Overtime I have wrote less about that and more on general topics like travel. Today I will return to a technical topic about the Jyutping(粵拼) Cantonese romanization system.

I start off with the quest to learn and master a Chinese input method. Years ago I started with the Cangjie (倉頡) system. I never get much beyond basics because it is a difficult system to learn, let alone to master. Then I looked at pronunciation based system. I am glad to find out Cantonese based system is readily available. Out of the multiple romanization system, people seems to have gravitated toward the Jyutping system by the Linguistic Society of Hong Kong.

So the next step is to get familiar with the Jyutping system, which is not trivial for me because I am weak in phonetics. It will be very useful if there is a service to annotate a piece of Chinese text with the pronunciation under each character. Unfortunately I can find no such software besides some dictionary that does it character by character. Instead I have decided to write one myself, as a naïve translation should not be difficult to write.

Now all I need is a table of all Chinese characters and its Jyutping, i.e. the Jyutping specification. I have spent days searching the internet and come out empty handed. Linguistic Society of Hong Kong themselves provides little more than a general description. It is a shame even some links to its description are broken.

The good news is I have finally found it from the Unicode Han Database, a place I have crossed many time but have not realized they have compiled the most comprehensive data on Chinese characters, including Jyutping and even Cangjie code. With the database here I am ready for business!

(2010-02-17 Thanks Helena for the heads up. The Unihan database format has since changed. The new download link is Unihan.zip. Some general description such as Unicode NamesList File Format are also available. )

2007.11.29 [, , ] - comments

 

 

blog comments powered by Disqus

past articles »

 

BBC News

 

Trump-Russia ties: Senate pledges thorough inquiry (30 Mar 2017)

 

Insurer Lloyd's of London confirms new Brussels subsidiary (30 Mar 2017)

 

Ivanka Trump to be assistant to US president (30 Mar 2017)

 

'Exploding' soup dumpling video angers Asian foodies (30 Mar 2017)

 

Brexit: UK to set out plans to replace all EU laws (30 Mar 2017)

 

Turkey: Euphrates Shield campaign in Syria 'ended' (30 Mar 2017)

 

PwC keeps Oscars despite best film blunder (30 Mar 2017)

 

North Carolina 'bathroom' law: Deal reached on repeal (30 Mar 2017)

 

Trump to meet China's Xi in Florida next week (30 Mar 2017)

 

Italian police in Venice arrest three Jihadists (30 Mar 2017)

more »

 

SF Gate

 

Bay Area News (7 Jan 2012)

 

City Insider (11 Feb 2012)

 

Crime Scene (13 Feb 2012)

 

C.W Newius Column (10 Jan 2012)

 

C.W. Nevius Blog (11 Feb 2012)

 

Education News (10 Jan 2012)

 

KALW (11 Feb 2012)

 

Matier and Ross Blog (11 Feb 2012)

 

Bill to let bars serve alcohol until 4 a.m. advances (29 Mar 2017)

 

Pinterest seeks new ad revenue sources ahead of possible IPO (29 Mar 2017)

 

Westinghouse files for bankruptcy (29 Mar 2017)

 

Man sentenced in iPhone frauds (29 Mar 2017)

 

PG&E agrees to .5m penalty in San Bruno fallout; bills to drop (29 Mar 2017)

 

Top General Says Human Rights Shouldn't Hold Up U.S. Arms Sales (29 Mar 2017)

more »

 


Site feed Updated: 2017-Mar-30 03:00