• DocumentCode
    532797
  • Title

    A pragmatic Chinese ord Segmentation

  • Author

    Jun, Li ; Jun, Wen ; Xiaofeng, Wan

  • Author_Institution
    Liupanshui Tobacco Corp., Liupanshui, China
  • Volume
    14
  • fYear
    2010
  • fDate
    22-24 Oct. 2010
  • Abstract
    Ord segmentation is the first step in Chinese information processing, and its performance has a great influence on the next processing steps. This paper presents a pragmatic approach to Chinese word segmentation. It applies the Maximum Matching algorithm and name entity word rules to achieve accurate Chinese word segmentation. The experiment proves that they have high accuracy in Chinese word process.
  • Keywords
    information retrieval; natural language processing; text analysis; word processing; Chinese information processing; Chinese word process; maximum matching algorithm; named entity recognition; natural language texts; pragmatic Chinese word segmentation; Computational modeling; Context modeling; Statistical analysis; lexicon; named entity recognition (NER); word segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Application and System Modeling (ICCASM), 2010 International Conference on
  • Conference_Location
    Taiyuan
  • Print_ISBN
    978-1-4244-7235-2
  • Electronic_ISBN
    978-1-4244-7237-6
  • Type

    conf

  • DOI
    10.1109/ICCASM.2010.5622340
  • Filename
    5622340