Title :
Correlation clustering based on genetic algorithm for documents clustering
Author :
Zhang, Zhenya ; Cheng, Hongmei ; Chen, Wanli ; Zhang, Shuguang ; Fang, Qiansheng
Author_Institution :
Sch. of Electr. & Inf. Eng., Anhui Inst. of Archit. & Ind. (AIAI), Hefei
Abstract :
Correlation clustering problem is a NP hard problem and technologies for the solving of correlation clustering problem can be used to cluster given data set with relation matrix for data in the given data set. In this paper, an approach based on genetic algorithm for correlation clustering problem, named as GeneticCC, is presented. To estimate the performance of a clustering division, data correlation based clustering precision is defined and features of clustering precision are discussed in this paper. Experimental results show that the performance of clustering division for UCI document data set constructed by GeneticCC is better than clustering performance of other clustering divisions constructed by SOM neural network with clustering precision as criterion.
Keywords :
computational complexity; document handling; genetic algorithms; pattern clustering; self-organising feature maps; statistical analysis; GeneticCC; NP hard problem; SOM neural network; clustering division; correlation clustering; data correlation; documents clustering; genetic algorithm; Clustering algorithms; Communication industry; Computer architecture; Genetic algorithms; Laboratories; Multimedia computing; NP-hard problem; Neural networks; Partitioning algorithms; Text analysis;
Conference_Titel :
Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1822-0
Electronic_ISBN :
978-1-4244-1823-7
DOI :
10.1109/CEC.2008.4631230