Title :
An evolutionary approach for the clustering data problem
Author :
Soares, Rodrigo G F ; Silva, Kelly P. ; Ludermir, Teresa B. ; De Carvalho, Francisco A T
Author_Institution :
Center of Inf., Fed. Univ. of Pernambuco, Recife
Abstract :
The clustering problem consists in the discovery of interesting groups in a dataset. Such task is very important and widely tackled in the literature. In this paper, we propose an evolutionary method in order to obtain well formed and spatially separated clusters. The proposed algorithm uses a complete solution representation, each partition is represented by a length-variable chromosome. The variation operators were chosen to facilitate the exchange of clustering information between individuals. We have put two complementary clustering criteria together in the fitness function, so that the method can find clusters with arbitrary shapes. The k-means algorithm was the basis of the local search operator, such operator might refine the clustering solutions. The population diversity was an important issue for the algorithm, so a diversity maintenance scheme was employed. Differently from other existing clustering algorithms, our algorithm does not need the setting of the number of clusters in advance. We evaluated the method in different contexts, using both real and simulated data.
Keywords :
evolutionary computation; pattern clustering; data clustering problem; diversity maintenance scheme; evolutionary method; fitness function; k-means algorithm; length-variable chromosome; Biological cells; Clustering algorithms; Context modeling; Data analysis; Evolutionary computation; Informatics; Machine learning; Machine learning algorithms; Partitioning algorithms; Shape;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4634064