• DocumentCode
    2244399
  • Title

    A rule-based method for commas´ disambiguation in Chinese patent text

  • Author

    Qianqian Song ; Yun Zhu ; Lixia Wang ; Yaohong Jin

  • Author_Institution
    Inst. of Chinese Inf. Process., Beijing Normal Univ., Beijing, China
  • fYear
    2012
  • fDate
    Oct. 30 2012-Nov. 1 2012
  • Firstpage
    1506
  • Lastpage
    1510
  • Abstract
    We described a rule-based method for disambiguating Chinese commas in patent text which will be beneficial to the work on Chinese-English Patent MT. We annotated ten thousand sentences of patent text and made a number of rules according to the annotated results. Experiments were conducted on 5 intact patent documents containing 1219 commas and our model achieves an accuracy of over 90% overall.
  • Keywords
    data mining; information retrieval; language translation; natural language processing; text analysis; Chinese comma disambiguation; Chinese patent text; Chinese-English patent MT; patent documents; rule-based method; text annotation; Accuracy; Cloud computing; Educational institutions; Natural language processing; Patents; Semantics; Syntactics; Chinese patent text; MT; Rule-based method; commas´ disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4673-1855-6
  • Type

    conf

  • DOI
    10.1109/CCIS.2012.6664636
  • Filename
    6664636