DocumentCode
2283620
Title
Research on the text clustering algorithm based on latent semantic analysis and optimization
Author
Chun-hong, Wang ; Li-Li, Nan ; Yao-Peng, Ren
Author_Institution
Comput. Sci. & Technol., Yun cheng Univ., Yun cheng, China
Volume
4
fYear
2011
fDate
10-12 June 2011
Firstpage
470
Lastpage
473
Abstract
The text clustering based on Vector Space Model has problems, such as high-dimensional and sparse, unable to solve synonym and polyseme etc. And meanwhile, k-means clustering algorithm has shortcomings, which depends on the initial clustering center and needs to fix the number of clusters in advance. Aiming at these problems, in this paper, a text clustering algorithm based on Latent Semantic Analysis and Optimization is proposed. This algorithm can not only overcome the problems of Vector Space Model, but also can avoid the shortcomings of k-means algorithm. And compared with the text clustering algorithm based on Latent Semantic Analysis and the text clustering algorithm based on Vector Space Model and optimization, our algorithm is proved which can preferably improve the effect of text clustering, and upgrade the precision ratio and recall ration of text.
Keywords
optimisation; pattern clustering; text analysis; vectors; global optimization algorithm; k-means clustering algorithm; latent semantic analysis; text clustering algorithm; vector space model; Latent Semantic Analysis; Vector Space Model; clustering optimization; k-means clustering algorithm; text clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-8727-1
Type
conf
DOI
10.1109/CSAE.2011.5952891
Filename
5952891
Link To Document