Title :
Resolution to Combinational Ambiguity of Chinese Word Segmentation
Author :
Liu, JiangYang ; Liu, Ying
Author_Institution :
Dept. of Chinese Language & Literature, Tsinghua Univ., Beijing, China
Abstract :
Chinese word segmentation ambiguity can be divided into two categories: overlapped ambiguity and combinational ambiguity. This paper only focuses on the resolution to combinational ambiguity of Chinese word segmentation. We select 36 typical combinational ambiguity strings, and make use of transformation-based learning methods to learn the rules of combinational ambiguity. Using these rules to test "People\´s Daily" Corpus of 1996, we find that the average precision rate is improved from 79.08% to 94.35%.
Keywords :
learning (artificial intelligence); word processing; Chinese word segmentation; combinational ambiguity; overlapped ambiguity; transformation-based learning methods; Electronic government; Electronic learning; Error analysis; Error correction; Information systems; Learning systems; Natural languages; Tagging; Testing; White spaces; combinational ambiguity; disambiguation; transformation-based learning;
Conference_Titel :
E-Learning, E-Business, Enterprise Information Systems, and E-Government, 2009. EEEE '09. International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-0-7695-3907-2
DOI :
10.1109/EEEE.2009.38