DocumentCode :
3583804
Title :
A disambiguate method to covering ambiguity based on the collocation information
Author :
Feng, Su-Qin ; Jiao, Li-juan
Author_Institution :
Dept. of Comput. Sci., XinZhou Teachers Univ., Xinzhou, China
Volume :
7
fYear :
2010
Firstpage :
3669
Lastpage :
3672
Abstract :
Covering ambiguity is a vital issue in Chinese word segmentation. The paper presents the disambiguation strategies based on the collocation information. Firstly, it gets the word that is combinatorial ambiguities from a larger scaled corpus, then counts up it´s collocation information. Lastly it uses multi maximal log algorithm for disambiguation. Further the paper uses disambiguated corpus to strengthen and stabilize collocations It is proved to be an easy and effective way in the experiments.
Keywords :
natural language processing; Chinese word segmentation; collocation information; combinatorial ambiguity; disambiguate method; multimaximal log algorithm; Accuracy; Algorithm design and analysis; Computers; Context; Heuristic algorithms; Sun; Training; Chinese word segmentation; Collocation information; Covering ambiguities; Disambiguate; multi maximal log;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation (ICNC), 2010 Sixth International Conference on
Print_ISBN :
978-1-4244-5958-2
Type :
conf
DOI :
10.1109/ICNC.2010.5583733
Filename :
5583733
Link To Document :
بازگشت