• DocumentCode
    1240181
  • Title

    A weighted sum validity function for clustering with a hybrid niching genetic algorithm

  • Author

    Sheng, Weiguo ; Swift, Stephen ; Zhang, Leishi ; Liu, Xiaohui

  • Author_Institution
    Dept. of Inf. Syst. & Comput., Brunel Univ., London, UK
  • Volume
    35
  • Issue
    6
  • fYear
    2005
  • Firstpage
    1156
  • Lastpage
    1167
  • Abstract
    Clustering is inherently a difficult problem, both with respect to the construction of adequate objective functions as well as to the optimization of the objective functions. In this paper, we suggest an objective function called the Weighted Sum Validity Function (WSVF), which is a weighted sum of the several normalized cluster validity functions. Further, we propose a Hybrid Niching Genetic Algorithm (HNGA), which can be used for the optimization of the WSVF to automatically evolve the proper number of clusters as well as appropriate partitioning of the data set. Within the HNGA, a niching method is developed to preserve both the diversity of the population with respect to the number of clusters encoded in the individuals and the diversity of the subpopulation with the same number of clusters during the search. In addition, we hybridize the niching method with the k-means algorithm. In the experiments, we show the effectiveness of both the HNGA and the WSVF. In comparison with other related genetic clustering algorithms, the HNGA can consistently and efficiently converge to the best known optimum corresponding to the given data in concurrence with the convergence result. The WSVF is found generally able to improve the confidence of clustering solutions and achieve more accurate and robust results.
  • Keywords
    genetic algorithms; pattern clustering; WSVF objective function; evolutionary computation; genetic clustering algorithms; hybrid niching genetic algorithm; k-means algorithm; normalized cluster validity functions; optimization; weighted sum validity function; Clustering algorithms; Clustering methods; Evolutionary computation; Genetic algorithms; Heuristic algorithms; Information systems; Organizing; Partitioning algorithms; Robustness; Shape; Cluster validity; clustering; evolutionary computation; genetic algorithms; niching methods; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Information Storage and Retrieval; Models, Genetic; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2005.850173
  • Filename
    1542262