DocumentCode :
2244399
Title :
A rule-based method for commas´ disambiguation in Chinese patent text
Author :
Qianqian Song ; Yun Zhu ; Lixia Wang ; Yaohong Jin
Author_Institution :
Inst. of Chinese Inf. Process., Beijing Normal Univ., Beijing, China
fYear :
2012
fDate :
Oct. 30 2012-Nov. 1 2012
Firstpage :
1506
Lastpage :
1510
Abstract :
We described a rule-based method for disambiguating Chinese commas in patent text which will be beneficial to the work on Chinese-English Patent MT. We annotated ten thousand sentences of patent text and made a number of rules according to the annotated results. Experiments were conducted on 5 intact patent documents containing 1219 commas and our model achieves an accuracy of over 90% overall.
Keywords :
data mining; information retrieval; language translation; natural language processing; text analysis; Chinese comma disambiguation; Chinese patent text; Chinese-English patent MT; patent documents; rule-based method; text annotation; Accuracy; Cloud computing; Educational institutions; Natural language processing; Patents; Semantics; Syntactics; Chinese patent text; MT; Rule-based method; commas´ disambiguation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-1855-6
Type :
conf
DOI :
10.1109/CCIS.2012.6664636
Filename :
6664636
Link To Document :
بازگشت