DocumentCode :
3513399
Title :
Study on Association Rules Mining Based Chinese Text Representation
Author :
Li, Fang ; Zhu, Qunxiong
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Univ. of Chem. Technol., Beijing
fYear :
2008
fDate :
1-3 Nov. 2008
Firstpage :
725
Lastpage :
728
Abstract :
In this paper, the problem of text representation in the process of text mining is mainly discussed. The paper focuses on how to simplify the text model in advance of the construction of term-by-document matrix. By using association rules mining method to find the highly correlative words to form words-set, the vocabulary set is decreased effectively, which leads to the text modelpsilas simplification directly. During this process, some incremental update problems of text representation are also introduced. In the end, a simulation case validate that the method is not only efficient but also helpful to the further text clustering.
Keywords :
data mining; matrix algebra; natural language processing; text analysis; Chinese text representation; association rules mining; term-by-document matrix; vocabulary set; Association rules; Chemical technology; Computer science; Data mining; Data structures; Intelligent networks; Intelligent systems; Principal component analysis; Text mining; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Networks and Intelligent Systems, 2008. ICINIS '08. First International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3391-9
Electronic_ISBN :
978-0-7695-3391-9
Type :
conf
DOI :
10.1109/ICINIS.2008.117
Filename :
4683327
Link To Document :
بازگشت