DocumentCode
2244399
Title
A rule-based method for commas´ disambiguation in Chinese patent text
Author
Qianqian Song ; Yun Zhu ; Lixia Wang ; Yaohong Jin
Author_Institution
Inst. of Chinese Inf. Process., Beijing Normal Univ., Beijing, China
fYear
2012
fDate
Oct. 30 2012-Nov. 1 2012
Firstpage
1506
Lastpage
1510
Abstract
We described a rule-based method for disambiguating Chinese commas in patent text which will be beneficial to the work on Chinese-English Patent MT. We annotated ten thousand sentences of patent text and made a number of rules according to the annotated results. Experiments were conducted on 5 intact patent documents containing 1219 commas and our model achieves an accuracy of over 90% overall.
Keywords
data mining; information retrieval; language translation; natural language processing; text analysis; Chinese comma disambiguation; Chinese patent text; Chinese-English patent MT; patent documents; rule-based method; text annotation; Accuracy; Cloud computing; Educational institutions; Natural language processing; Patents; Semantics; Syntactics; Chinese patent text; MT; Rule-based method; commas´ disambiguation;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-4673-1855-6
Type
conf
DOI
10.1109/CCIS.2012.6664636
Filename
6664636
Link To Document