Title :
A pragmatic Chinese ord Segmentation
Author :
Jun, Li ; Jun, Wen ; Xiaofeng, Wan
Author_Institution :
Liupanshui Tobacco Corp., Liupanshui, China
Abstract :
Ord segmentation is the first step in Chinese information processing, and its performance has a great influence on the next processing steps. This paper presents a pragmatic approach to Chinese word segmentation. It applies the Maximum Matching algorithm and name entity word rules to achieve accurate Chinese word segmentation. The experiment proves that they have high accuracy in Chinese word process.
Keywords :
information retrieval; natural language processing; text analysis; word processing; Chinese information processing; Chinese word process; maximum matching algorithm; named entity recognition; natural language texts; pragmatic Chinese word segmentation; Computational modeling; Context modeling; Statistical analysis; lexicon; named entity recognition (NER); word segmentation;
Conference_Titel :
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-7235-2
Electronic_ISBN :
978-1-4244-7237-6
DOI :
10.1109/ICCASM.2010.5622340