• DocumentCode
    3513399
  • Title

    Study on Association Rules Mining Based Chinese Text Representation

  • Author

    Li, Fang ; Zhu, Qunxiong

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Beijing Univ. of Chem. Technol., Beijing
  • fYear
    2008
  • fDate
    1-3 Nov. 2008
  • Firstpage
    725
  • Lastpage
    728
  • Abstract
    In this paper, the problem of text representation in the process of text mining is mainly discussed. The paper focuses on how to simplify the text model in advance of the construction of term-by-document matrix. By using association rules mining method to find the highly correlative words to form words-set, the vocabulary set is decreased effectively, which leads to the text modelpsilas simplification directly. During this process, some incremental update problems of text representation are also introduced. In the end, a simulation case validate that the method is not only efficient but also helpful to the further text clustering.
  • Keywords
    data mining; matrix algebra; natural language processing; text analysis; Chinese text representation; association rules mining; term-by-document matrix; vocabulary set; Association rules; Chemical technology; Computer science; Data mining; Data structures; Intelligent networks; Intelligent systems; Principal component analysis; Text mining; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Networks and Intelligent Systems, 2008. ICINIS '08. First International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3391-9
  • Electronic_ISBN
    978-0-7695-3391-9
  • Type

    conf

  • DOI
    10.1109/ICINIS.2008.117
  • Filename
    4683327