Title :
Study on Association Rules Mining Based Chinese Text Representation
Author :
Li, Fang ; Zhu, Qunxiong
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Univ. of Chem. Technol., Beijing
Abstract :
In this paper, the problem of text representation in the process of text mining is mainly discussed. The paper focuses on how to simplify the text model in advance of the construction of term-by-document matrix. By using association rules mining method to find the highly correlative words to form words-set, the vocabulary set is decreased effectively, which leads to the text modelpsilas simplification directly. During this process, some incremental update problems of text representation are also introduced. In the end, a simulation case validate that the method is not only efficient but also helpful to the further text clustering.
Keywords :
data mining; matrix algebra; natural language processing; text analysis; Chinese text representation; association rules mining; term-by-document matrix; vocabulary set; Association rules; Chemical technology; Computer science; Data mining; Data structures; Intelligent networks; Intelligent systems; Principal component analysis; Text mining; Vocabulary;
Conference_Titel :
Intelligent Networks and Intelligent Systems, 2008. ICINIS '08. First International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3391-9
Electronic_ISBN :
978-0-7695-3391-9
DOI :
10.1109/ICINIS.2008.117