• DocumentCode
    172536
  • Title

    A rule-based method for Chinese punctuations processing in sentences segmentation

  • Author

    Jing Wang ; Yun Zhu ; Yaohong Jin

  • Author_Institution
    Inst. of Chinese Inf. Process., Beijing Normal Univ., Beijing, China
  • fYear
    2014
  • fDate
    20-22 Oct. 2014
  • Firstpage
    195
  • Lastpage
    198
  • Abstract
    In this paper, a rule-based sentence segmentation system is proposed. We studied the usage and function of Chinese punctuation marks, and classified them into 4 categories. According to whether punctuation can split a sentence, we tagged it with a label SST or un-SST. Experiments were conducted on 4 different kinds of corpus containing 12 kinds of Chinese punctuation marks, and our model achieves a high F-measure over 90% overall. Experiment results show that our approach is effectively for sentence segmentation.
  • Keywords
    knowledge based systems; natural language processing; Chinese punctuation marks; Chinese punctuations processing; F-measure; label SST; rule-based sentence segmentation system; sentences segmentation; unSST; Educational institutions; Information processing; Natural language processing; Patents; Presses; Semantics; Syntactics; Chinese Punctuation; Rule-Based Method; Sentence Segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2014 International Conference on
  • Conference_Location
    Kuching
  • Type

    conf

  • DOI
    10.1109/IALP.2014.6973504
  • Filename
    6973504