• DocumentCode
    618207
  • Title

    A Multi-Objective Genetic Graph-Based Clustering algorithm with memory optimization

  • Author

    Menendez, Hector D. ; Barrero, David F. ; Camacho, David

  • Author_Institution
    Comput. Sci. Dept., Univ. Autonoma de Madrid, Madrid, Spain
  • fYear
    2013
  • fDate
    20-23 June 2013
  • Firstpage
    3174
  • Lastpage
    3181
  • Abstract
    Clustering is one of the most versatile tools for data analysis. Over the last few years, clustering that seeks the continuity of data (in opposition to classical centroid-based approaches) has attracted an increasing research interest. It is a challenging problem with a remarkable practical interest. The most popular continuity clustering method is the Spectral Clustering algorithm, which is based on graph cut: it initially generates a Similarity Graph using a distance measure and then uses its Graph Spectrum to find the best cut. Memory consuption is a serious limitation in that algorithm: The Similarity Graph representation usually requires a very large matrix with a high memory cost. This work proposes a new algorithm, based on a previous implementation named Genetic Graph-based Clustering (GGC), that improves the memory usage while maintaining the quality of the solution. The new algorithm, called Multi-Objective Genetic Graph-based Clustering (MOGGC), uses an evolutionary approach introducing a Multi-Objective Genetic Algorithm to manage a reduced version of the Similarity Graph. The experimental validation shows that MOGGC increases the memory efficiency, maintaining and improving the GGC results in the synthetic and real datasets used in the experiments. An experimental comparison with several classical clustering methods (EM, SC and K-means) has been included to show the efficiency of the proposed algorithm.
  • Keywords
    data analysis; data mining; expectation-maximisation algorithm; genetic algorithms; graph theory; learning (artificial intelligence); pattern clustering; EM clustering method; MOGGC algorithm; SC clustering method; centroid-based approach; continuity clustering method; data analysis; distance measure; expectation-maximisation; graph spectrum; k-means clustering method; memory optimization; multiobjective genetic graph-based clustering; similarity graph; spectral clustering algorithm; Algorithm design and analysis; Clustering algorithms; Genetic algorithms; Genetics; Measurement; Sociology; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2013 IEEE Congress on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4799-0453-2
  • Electronic_ISBN
    978-1-4799-0452-5
  • Type

    conf

  • DOI
    10.1109/CEC.2013.6557958
  • Filename
    6557958