DocumentCode
3369819
Title
Improved GA-based text clustering algorithm
Author
Shi, Kansheng ; Li, Lemin ; He, Jie ; Zhang, Naitong ; Liu, Haitao ; Song, Wentao
Author_Institution
Shanghai Jiaotong Univ., Shanghai, China
fYear
2011
fDate
28-30 Oct. 2011
Firstpage
675
Lastpage
679
Abstract
The traditional K-means algorithm is sensitive to the initial points and easy to fall into local optimum. To avoid this kind of flaw, an improved GA-based text clustering algorithm CGHCM is proposed. The new algorithm is proven effective to avoid falling into local optimum and obtains better clustering results.
Keywords
genetic algorithms; pattern clustering; text analysis; unsupervised learning; GA-based text clustering algorithm; K-means algorithm; genetic algorithm; unsupervised machine learning method; Accuracy; Algorithm design and analysis; Biological cells; Clustering algorithms; Genetic algorithms; Mathematical model; Vectors; GA; K-means; Similarity measurement; VSM; text clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Broadband Network and Multimedia Technology (IC-BNMT), 2011 4th IEEE International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-61284-158-8
Type
conf
DOI
10.1109/ICBNMT.2011.6156021
Filename
6156021
Link To Document