• DocumentCode
    1508490
  • Title

    Genetic K-means algorithm

  • Author

    Krishna, K. ; Murty, M. Narasimha

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
  • Volume
    29
  • Issue
    3
  • fYear
    1999
  • fDate
    6/1/1999 12:00:00 AM
  • Firstpage
    433
  • Lastpage
    439
  • Abstract
    In this paper, we propose a novel hybrid genetic algorithm (GA) that finds a globally optimal partition of a given data into a specified number of clusters. GA´s used earlier in clustering employ either an expensive crossover operator to generate valid child chromosomes from parent chromosomes or a costly fitness function or both. To circumvent these expensive operations, we hybridize GA with a classical gradient descent algorithm used in clustering, viz. K-means algorithm. Hence, the name genetic K-means algorithm (GKA). We define K-means operator, one-step of K-means algorithm, and use it in GKA as a search operator instead of crossover. We also define a biased mutation operator specific to clustering called distance-based-mutation. Using finite Markov chain theory, we prove that the GKA converges to the global optimum. It is observed in the simulations that GKA converges to the best known optimum corresponding to the given data in concurrence with the convergence result. It is also observed that GKA searches faster than some of the other evolutionary algorithms used for clustering
  • Keywords
    genetic algorithms; pattern clustering; unsupervised learning; K-means algorithm; clustering; distance-based-mutation; finite Markov chain theory; globally optimal partition; gradient descent algorithm; hybrid genetic algorithm; Biological cells; Clustering algorithms; Convergence; Data analysis; Evolutionary computation; Genetic algorithms; Genetic mutations; Iterative algorithms; Partitioning algorithms; Pattern analysis;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/3477.764879
  • Filename
    764879