DocumentCode
532797
Title
A pragmatic Chinese ord Segmentation
Author
Jun, Li ; Jun, Wen ; Xiaofeng, Wan
Author_Institution
Liupanshui Tobacco Corp., Liupanshui, China
Volume
14
fYear
2010
fDate
22-24 Oct. 2010
Abstract
Ord segmentation is the first step in Chinese information processing, and its performance has a great influence on the next processing steps. This paper presents a pragmatic approach to Chinese word segmentation. It applies the Maximum Matching algorithm and name entity word rules to achieve accurate Chinese word segmentation. The experiment proves that they have high accuracy in Chinese word process.
Keywords
information retrieval; natural language processing; text analysis; word processing; Chinese information processing; Chinese word process; maximum matching algorithm; named entity recognition; natural language texts; pragmatic Chinese word segmentation; Computational modeling; Context modeling; Statistical analysis; lexicon; named entity recognition (NER); word segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location
Taiyuan
Print_ISBN
978-1-4244-7235-2
Electronic_ISBN
978-1-4244-7237-6
Type
conf
DOI
10.1109/ICCASM.2010.5622340
Filename
5622340
Link To Document