Title : 
Discussion on Collation of Tibetan Syllable
         
        
            Author : 
Huang, Heming ; Da, Feipeng
         
        
            Author_Institution : 
Sch. of Autom., Southeast Univ., Nanjing, China
         
        
        
        
        
        
            Abstract : 
Based on the general syllable structure, a syllable´s component letters should be expanded orderly into the series of basic consonant, prefix consonant, head consonant... and the second suffix consonant. If there is no letter in a syllable´s particular position, a special character, whose collation element is less than that of any Tibetan letter, should be used in the corresponding position of the expanded series. Thus, we have a character series that is canonically equivalent to the Tibetan syllable. Furthermore, a syllable´s collation element series could be developed by introducing each character´s collation element and each syllable could be collated correctly with its collation element series. However, for the sake of memory saving, a syllable´s collation element series is compressed with the Run-Length algorithm and the final compression ratio reaches 4:1.
         
        
            Keywords : 
natural language processing; Tibetan letter; Tibetan syllable collation; basic consonant; collation element; head consonant; prefix consonant; syllable component letters; syllable structure; Asia; Books; Dictionaries; Information processing; Shape; Transforms; Tibetan; collation; syllable; universal structure;
         
        
        
        
            Conference_Titel : 
Asian Language Processing (IALP), 2010 International Conference on
         
        
            Conference_Location : 
Harbin
         
        
            Print_ISBN : 
978-1-4244-9063-9
         
        
        
            DOI : 
10.1109/IALP.2010.27