DocumentCode :
2283620
Title :
Research on the text clustering algorithm based on latent semantic analysis and optimization
Author :
Chun-hong, Wang ; Li-Li, Nan ; Yao-Peng, Ren
Author_Institution :
Comput. Sci. & Technol., Yun cheng Univ., Yun cheng, China
Volume :
4
fYear :
2011
fDate :
10-12 June 2011
Firstpage :
470
Lastpage :
473
Abstract :
The text clustering based on Vector Space Model has problems, such as high-dimensional and sparse, unable to solve synonym and polyseme etc. And meanwhile, k-means clustering algorithm has shortcomings, which depends on the initial clustering center and needs to fix the number of clusters in advance. Aiming at these problems, in this paper, a text clustering algorithm based on Latent Semantic Analysis and Optimization is proposed. This algorithm can not only overcome the problems of Vector Space Model, but also can avoid the shortcomings of k-means algorithm. And compared with the text clustering algorithm based on Latent Semantic Analysis and the text clustering algorithm based on Vector Space Model and optimization, our algorithm is proved which can preferably improve the effect of text clustering, and upgrade the precision ratio and recall ration of text.
Keywords :
optimisation; pattern clustering; text analysis; vectors; global optimization algorithm; k-means clustering algorithm; latent semantic analysis; text clustering algorithm; vector space model; Latent Semantic Analysis; Vector Space Model; clustering optimization; k-means clustering algorithm; text clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-8727-1
Type :
conf
DOI :
10.1109/CSAE.2011.5952891
Filename :
5952891
Link To Document :
بازگشت