DocumentCode
3513399
Title
Study on Association Rules Mining Based Chinese Text Representation
Author
Li, Fang ; Zhu, Qunxiong
Author_Institution
Sch. of Comput. Sci. & Technol., Beijing Univ. of Chem. Technol., Beijing
fYear
2008
fDate
1-3 Nov. 2008
Firstpage
725
Lastpage
728
Abstract
In this paper, the problem of text representation in the process of text mining is mainly discussed. The paper focuses on how to simplify the text model in advance of the construction of term-by-document matrix. By using association rules mining method to find the highly correlative words to form words-set, the vocabulary set is decreased effectively, which leads to the text modelpsilas simplification directly. During this process, some incremental update problems of text representation are also introduced. In the end, a simulation case validate that the method is not only efficient but also helpful to the further text clustering.
Keywords
data mining; matrix algebra; natural language processing; text analysis; Chinese text representation; association rules mining; term-by-document matrix; vocabulary set; Association rules; Chemical technology; Computer science; Data mining; Data structures; Intelligent networks; Intelligent systems; Principal component analysis; Text mining; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Networks and Intelligent Systems, 2008. ICINIS '08. First International Conference on
Conference_Location
Wuhan
Print_ISBN
978-0-7695-3391-9
Electronic_ISBN
978-0-7695-3391-9
Type
conf
DOI
10.1109/ICINIS.2008.117
Filename
4683327
Link To Document