DocumentCode :
507192
Title :
An Improved Chinese Segmentation Algorithm Based on Segmentation Dictionary
Author :
Niu, Yan ; Li, Lala
Author_Institution :
Comput. Sch., Hubei Univ. of Technol., Wuhan, China
Volume :
1
fYear :
2009
fDate :
13-15 Nov. 2009
Firstpage :
184
Lastpage :
187
Abstract :
Based on the analysis of the traditional forward maximum matching word segmentation algorithm and the characteristics of the principle on the basis of the results of the use of word frequency statistics, we design a new structure of the dictionary, a dictionary based on the new structure to improve the matching algorithm are the largest. After time complexity analysis and experiments, the improved forward maximum matching algorithm can further improve the efficiency of segmentation.
Keywords :
dictionaries; linguistics; natural language processing; Chinese segmentation algorithm; forward maximum matching word segmentation algorithm; segmentation dictionary; time complexity analysis; word frequency statistics; Algorithm design and analysis; Dictionaries; Frequency; Handicapped aids; Information analysis; Information processing; Machine assisted indexing; Natural languages; Statistical analysis; White spaces; Chinese information processing; Chinese word segmentation; FMM algorithm; two-word root;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Technology and Development, 2009. ICCTD '09. International Conference on
Conference_Location :
Kota Kinabalu
Print_ISBN :
978-0-7695-3892-1
Type :
conf
DOI :
10.1109/ICCTD.2009.125
Filename :
5359795
Link To Document :
بازگشت