Title :
Text Clustering via Particle Swarm Optimization
Author :
Lu, Yanping ; Wang, Shengrui ; Li, Shaozi ; Zhou, Changle
Author_Institution :
Dept. of Comput., Univ. of Sherbrooke, Sherbrooke, QC
fDate :
March 30 2009-April 2 2009
Abstract :
This paper presents an approach which extends a particle swarm optimizer for variable weighting (PSOVW) to handle the problem of text clustering, called text clustering via particle swarm optimization (TCPSO). PSOVW has been exploited for evolving optimal feature weights for clusters and has demonstrated to improve the clustering quality of high-dimensional data. However, when applying it for text clustering, there exist some modifications such as the similarity measure, parameter selection and the criterion function. Our experimental results on both four structured text datasets built from 20 newsgroups as well as four large-scale text datasets selected from CLUTO show that the proposed algorithm is able to greatly improve the quality of text clustering compared to four typical clustering algorithms and one competitive subspace clustering method.
Keywords :
data structures; particle swarm optimisation; pattern clustering; high-dimensional data; parameter selection; particle swarm optimization; structured text datasets; subspace clustering method; text clustering; variable weighting; Circuits; Clustering algorithms; Clustering methods; Frequency; Large-scale systems; Merging; Optimization methods; Particle swarm optimization; Partitioning algorithms; Text mining;
Conference_Titel :
Swarm Intelligence Symposium, 2009. SIS '09. IEEE
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2762-8
DOI :
10.1109/SIS.2009.4937843