• DocumentCode
    2397435
  • Title

    Study of segment dictionary based on two-dimensional array

  • Author

    Li, Chengcheng ; Wu, Hong

  • Author_Institution
    Sch. of Comput. & Inf. Eng., Inner Mongolia Normal Univ., Hohhot, China
  • fYear
    2010
  • fDate
    26-28 Oct. 2010
  • Firstpage
    674
  • Lastpage
    676
  • Abstract
    Chinese word automatic segmentation is the foundation of Chinese Information Processing, and it has widely application in many fields. In this paper, a new dictionary mechanism is presented: According to the Chinese characteristic of the high frequency of one word and two words we put forward such an idea that we can build up index table by the first two words as the keywords, and this index table is a two-dimensional array. This algorithm directly locates data by establishing a corresponding relationship between the first two Chinese characters´ internal code. In this way, we can directly find out the two-word words by using the two-dimensional array. This approach can significantly reduce the times of queries, so as to further accelerate the speed of segmentation.
  • Keywords
    dictionaries; query processing; word processing; Chinese automatic word segmentation; Chinese information processing; index table; query processing; segment dictionary; two-dimensional array; Dictionary Mechanism; Segmentation Dictionary; two-dimensional array;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Broadband Network and Multimedia Technology (IC-BNMT), 2010 3rd IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-6769-3
  • Type

    conf

  • DOI
    10.1109/ICBNMT.2010.5705175
  • Filename
    5705175