DocumentCode :
2669287
Title :
Research on Some Key Technologies of Tibetan Automatic Word Segmentation
Author :
Sun, Yuan ; Yan, Xiaodong ; Zhao, Xiaobing ; Yang, Guosheng
Author_Institution :
Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
fYear :
2011
fDate :
1-3 Nov. 2011
Firstpage :
188
Lastpage :
191
Abstract :
This paper researches on some key technologies of Tibetan automatic word segmentation. We propose a Tibetan automatic word segmentation approach, which is taking the advantage of case-auxiliary words and continuous feature. Meanwhile, a resolution method of overlapping ambiguity in Tibetan word segmentation is proposed, which is based on forward-backward scanning identification method and improved maximum probability algorithm. Finally, an experiment is conducted, and the results prove the algorithm is effective.
Keywords :
natural language processing; probability; word processing; Tibetan automatic word segmentation; case-auxiliary word; forward-backward scanning identification method; maximum probability algorithm; word segmentation ambiguity; Accuracy; Dictionaries; Educational institutions; Feature extraction; Grammar; Information processing; Text processing; Tibetan word segmentation; case-auxiliary words; continuous features; overlapping ambiguity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Networks and Intelligent Systems (ICINIS), 2011 4th International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4577-1626-3
Type :
conf
DOI :
10.1109/ICINIS.2011.43
Filename :
6104725
Link To Document :
بازگشت