DocumentCode
1240181
Title
A weighted sum validity function for clustering with a hybrid niching genetic algorithm
Author
Sheng, Weiguo ; Swift, Stephen ; Zhang, Leishi ; Liu, Xiaohui
Author_Institution
Dept. of Inf. Syst. & Comput., Brunel Univ., London, UK
Volume
35
Issue
6
fYear
2005
Firstpage
1156
Lastpage
1167
Abstract
Clustering is inherently a difficult problem, both with respect to the construction of adequate objective functions as well as to the optimization of the objective functions. In this paper, we suggest an objective function called the Weighted Sum Validity Function (WSVF), which is a weighted sum of the several normalized cluster validity functions. Further, we propose a Hybrid Niching Genetic Algorithm (HNGA), which can be used for the optimization of the WSVF to automatically evolve the proper number of clusters as well as appropriate partitioning of the data set. Within the HNGA, a niching method is developed to preserve both the diversity of the population with respect to the number of clusters encoded in the individuals and the diversity of the subpopulation with the same number of clusters during the search. In addition, we hybridize the niching method with the k-means algorithm. In the experiments, we show the effectiveness of both the HNGA and the WSVF. In comparison with other related genetic clustering algorithms, the HNGA can consistently and efficiently converge to the best known optimum corresponding to the given data in concurrence with the convergence result. The WSVF is found generally able to improve the confidence of clustering solutions and achieve more accurate and robust results.
Keywords
genetic algorithms; pattern clustering; WSVF objective function; evolutionary computation; genetic clustering algorithms; hybrid niching genetic algorithm; k-means algorithm; normalized cluster validity functions; optimization; weighted sum validity function; Clustering algorithms; Clustering methods; Evolutionary computation; Genetic algorithms; Heuristic algorithms; Information systems; Organizing; Partitioning algorithms; Robustness; Shape; Cluster validity; clustering; evolutionary computation; genetic algorithms; niching methods; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Information Storage and Retrieval; Models, Genetic; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
fLanguage
English
Journal_Title
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
1083-4419
Type
jour
DOI
10.1109/TSMCB.2005.850173
Filename
1542262
Link To Document