• DocumentCode
    507192
  • Title

    An Improved Chinese Segmentation Algorithm Based on Segmentation Dictionary

  • Author

    Niu, Yan ; Li, Lala

  • Author_Institution
    Comput. Sch., Hubei Univ. of Technol., Wuhan, China
  • Volume
    1
  • fYear
    2009
  • fDate
    13-15 Nov. 2009
  • Firstpage
    184
  • Lastpage
    187
  • Abstract
    Based on the analysis of the traditional forward maximum matching word segmentation algorithm and the characteristics of the principle on the basis of the results of the use of word frequency statistics, we design a new structure of the dictionary, a dictionary based on the new structure to improve the matching algorithm are the largest. After time complexity analysis and experiments, the improved forward maximum matching algorithm can further improve the efficiency of segmentation.
  • Keywords
    dictionaries; linguistics; natural language processing; Chinese segmentation algorithm; forward maximum matching word segmentation algorithm; segmentation dictionary; time complexity analysis; word frequency statistics; Algorithm design and analysis; Dictionaries; Frequency; Handicapped aids; Information analysis; Information processing; Machine assisted indexing; Natural languages; Statistical analysis; White spaces; Chinese information processing; Chinese word segmentation; FMM algorithm; two-word root;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Technology and Development, 2009. ICCTD '09. International Conference on
  • Conference_Location
    Kota Kinabalu
  • Print_ISBN
    978-0-7695-3892-1
  • Type

    conf

  • DOI
    10.1109/ICCTD.2009.125
  • Filename
    5359795