• DocumentCode
    2920758
  • Title

    Application of Chinese Word Segmentation Based on Linguistic Environment Analysis in Text Information Filtering System

  • Author

    Yi, Zhi-an ; Lv, Jia

  • Author_Institution
    Coll. of Comput. & Inf. Technol., Daqing Pet. Inst., Daqing
  • fYear
    2009
  • fDate
    20-22 Feb. 2009
  • Firstpage
    467
  • Lastpage
    470
  • Abstract
    This paper provides Chinese word segmentation based on language analysis problem in text information filtering system. The improved Chinese word segmentation is made of a bigram segmentation and a segmentation correction, new words recognition and disambiguation through the bigram segmentation, check the accuracy of segmentation results using the segmentation correction from the perspective of syntax. It has been proved by experiments that the segmentation not only strengthen the system´s language analysis ability, but also improve the accuracy of text information filtering system when the improved Chinese word segmentation was applied to the text analysis module.
  • Keywords
    information filtering; natural language processing; text analysis; word processing; Chinese word segmentation; bigram segmentation; language analysis problem; linguistic environment analysis; segmentation correction; syntax; text information filtering system; words recognition; Computer applications; Educational institutions; Information analysis; Information filtering; Information science; Information technology; Internet; Natural languages; Petroleum; Text analysis; Chinese word segmentation; Information filtration; disambiguation; segmentation correction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronic Computer Technology, 2009 International Conference on
  • Conference_Location
    Macau
  • Print_ISBN
    978-0-7695-3559-3
  • Type

    conf

  • DOI
    10.1109/ICECT.2009.89
  • Filename
    4796006