• DocumentCode
    2457027
  • Title

    Automatic Cluster Number Selection Using a Split and Merge K-Means Approach

  • Author

    Muhr, Markus ; Granitzer, Michael

  • Author_Institution
    Knowledge Relationship Discovery, Know-Center Graz, Graz, Austria
  • fYear
    2009
  • fDate
    Aug. 31 2009-Sept. 4 2009
  • Firstpage
    363
  • Lastpage
    367
  • Abstract
    The k-means method is a simple and fast clustering technique that exhibits the problem of specifying the optimal number of clusters preliminarily. We address the problem of cluster number selection by using a k-means approach that exploits local changes of internal validity indices to split or merge clusters. Our split and merge k-means issues criterion functions to select clusters to be split or merged and fitness assessments on cluster structure changes. Experiments on standard test data sets show that this approach selects an accurate number of clusters with reasonable runtime and accuracy.
  • Keywords
    pattern clustering; statistical analysis; automatic cluster number selection; internal validity indices; split and merge k-means approach; standard test data; Clustering algorithms; Clustering methods; Databases; Expert systems; Information retrieval; Knowledge management; Large-scale systems; Navigation; Runtime; Testing; cluster number selection; k-means; split and merge; validity indices;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Application, 2009. DEXA '09. 20th International Workshop on
  • Conference_Location
    Linz
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-3763-4
  • Type

    conf

  • DOI
    10.1109/DEXA.2009.39
  • Filename
    5337108