DocumentCode :
2396982
Title :
Text clustering ensemble based on genetic algorithms
Author :
Mao-ting Gao ; Bing-jing Wang
Author_Institution :
Coll. of Inf. Eng., Shanghai Maritime Univ., Shanghai, China
fYear :
2012
fDate :
19-20 May 2012
Firstpage :
2329
Lastpage :
2332
Abstract :
Text feature is usually expressed as a matrix of huge dimensionality in text mining, and common clustering algorithm are not stable and cannot obtain clustering solution efficiently. Latent Semantic Analysis can reduce dimensionality effectively, and emerges the semantic relations between texts and terms. Clustering ensemble can get better clustering solution than single clustering method. A text clustering ensemble based on genetic algorithms is presented, which combines Latent Semantic Analysis and Clustering ensemble based on genetic algorithms. Experiments have demonstrated that text clustering ensemble based on genetic algorithms can effectively improve the clustering performance.
Keywords :
data mining; genetic algorithms; natural language processing; pattern clustering; text analysis; dimensionality reduction; genetic algorithms; huge dimensionality; latent semantic analysis; text clustering ensemble; text feature; text mining; Accuracy; Algorithm design and analysis; Biological cells; Clustering algorithms; Genetic algorithms; Matrix decomposition; Semantics; clustering ensemble; genetic algorithm; latent semantic analysis; text clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems and Informatics (ICSAI), 2012 International Conference on
Conference_Location :
Yantai
Print_ISBN :
978-1-4673-0198-5
Type :
conf
DOI :
10.1109/ICSAI.2012.6223521
Filename :
6223521
Link To Document :
بازگشت