Title :
Chinese Chunking Based on Coarse-Grained Part-of-Speech Features
Author :
Sun, Guang-Lu ; Xue, Yibo ; Xu, Zhiming ; Lang, Fei
Author_Institution :
Res. Inst. of Inf. Technol., Tsinghua Univ., Beijing, China
Abstract :
Although part-of-speech (POS) is an effective feature for Chinese Chunking, the POS-tagging errors generated by automatic POS tagger leads to almost 10% performance drop in F-score. To solve this problem, this paper presents new features to replace the POS features, namely the coarse-grained part-of-speech features. Combining with the methods of processing out-of-vocabulary words, the new features are utilized in the Chinese chunking model. Experimental results show that the new features can contribute 2.71% performance improvement over the baseline method.
Keywords :
feature extraction; grammars; natural language processing; text analysis; Chinese chunking model; POS tagging errors; coarse grained part-of-speech features; out-of-vocabulary words; propagated errors; chunking; coarse-grained features; part-of-speech; propagated errors;
Conference_Titel :
Asian Language Processing, 2009. IALP '09. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-0-7695-3904-1
DOI :
10.1109/IALP.2009.54