Title :
Discussion on Collation of Tibetan Syllable
Author :
Huang, Heming ; Da, Feipeng
Author_Institution :
Sch. of Autom., Southeast Univ., Nanjing, China
Abstract :
Based on the general syllable structure, a syllable´s component letters should be expanded orderly into the series of basic consonant, prefix consonant, head consonant... and the second suffix consonant. If there is no letter in a syllable´s particular position, a special character, whose collation element is less than that of any Tibetan letter, should be used in the corresponding position of the expanded series. Thus, we have a character series that is canonically equivalent to the Tibetan syllable. Furthermore, a syllable´s collation element series could be developed by introducing each character´s collation element and each syllable could be collated correctly with its collation element series. However, for the sake of memory saving, a syllable´s collation element series is compressed with the Run-Length algorithm and the final compression ratio reaches 4:1.
Keywords :
natural language processing; Tibetan letter; Tibetan syllable collation; basic consonant; collation element; head consonant; prefix consonant; syllable component letters; syllable structure; Asia; Books; Dictionaries; Information processing; Shape; Transforms; Tibetan; collation; syllable; universal structure;
Conference_Titel :
Asian Language Processing (IALP), 2010 International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9063-9
DOI :
10.1109/IALP.2010.27