DocumentCode :
3418581
Title :
Text Clustering via Particle Swarm Optimization
Author :
Lu, Yanping ; Wang, Shengrui ; Li, Shaozi ; Zhou, Changle
Author_Institution :
Dept. of Comput., Univ. of Sherbrooke, Sherbrooke, QC
fYear :
2009
fDate :
March 30 2009-April 2 2009
Firstpage :
45
Lastpage :
51
Abstract :
This paper presents an approach which extends a particle swarm optimizer for variable weighting (PSOVW) to handle the problem of text clustering, called text clustering via particle swarm optimization (TCPSO). PSOVW has been exploited for evolving optimal feature weights for clusters and has demonstrated to improve the clustering quality of high-dimensional data. However, when applying it for text clustering, there exist some modifications such as the similarity measure, parameter selection and the criterion function. Our experimental results on both four structured text datasets built from 20 newsgroups as well as four large-scale text datasets selected from CLUTO show that the proposed algorithm is able to greatly improve the quality of text clustering compared to four typical clustering algorithms and one competitive subspace clustering method.
Keywords :
data structures; particle swarm optimisation; pattern clustering; high-dimensional data; parameter selection; particle swarm optimization; structured text datasets; subspace clustering method; text clustering; variable weighting; Circuits; Clustering algorithms; Clustering methods; Frequency; Large-scale systems; Merging; Optimization methods; Particle swarm optimization; Partitioning algorithms; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Swarm Intelligence Symposium, 2009. SIS '09. IEEE
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2762-8
Type :
conf
DOI :
10.1109/SIS.2009.4937843
Filename :
4937843
Link To Document :
بازگشت