DocumentCode
3320691
Title
A Text Clustering Algorithm Combining K-Means and Local Search Mechanism
Author
Cheng, Lanlan ; Sun, Yueheng ; Wei, Jinghui
Author_Institution
Sch. of Comput. Sci. & Inf. Eng., Tianjin Univ. of Sci. & Technol., Tianjin, China
fYear
2009
fDate
28-29 Dec. 2009
Firstpage
53
Lastpage
56
Abstract
Text clustering is one of common techniques in mining large scales of document data. The paper presents an improved K-means text clustering algorithm in which a local search mechanism is introduced. By the iteration process of K-means algorithm, our approach can quickly get a local extreme point, and then use the search strategy of local search mechanism to have K-means jump out of that point and get a better solution. The experimental results show that our approach achieves better performance in the terms of entropy than the traditional algorithm while not slowing down the clustering speed.
Keywords
data mining; entropy; iterative methods; pattern clustering; search problems; text analysis; K-means text clustering algorithm; document data mining; entropy; iteration process; local search mechanism; Clustering algorithms; Computer science; Data engineering; Data mining; Electronic mail; Entropy; Iterative algorithms; Large-scale systems; Partitioning algorithms; Sun; K-Means; local search mechanism; text clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Research Challenges in Computer Science, 2009. ICRCCS '09. International Conference on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3927-0
Electronic_ISBN
978-1-4244-5410-5
Type
conf
DOI
10.1109/ICRCCS.2009.21
Filename
5401292
Link To Document