DocumentCode
2457027
Title
Automatic Cluster Number Selection Using a Split and Merge K-Means Approach
Author
Muhr, Markus ; Granitzer, Michael
Author_Institution
Knowledge Relationship Discovery, Know-Center Graz, Graz, Austria
fYear
2009
fDate
Aug. 31 2009-Sept. 4 2009
Firstpage
363
Lastpage
367
Abstract
The k-means method is a simple and fast clustering technique that exhibits the problem of specifying the optimal number of clusters preliminarily. We address the problem of cluster number selection by using a k-means approach that exploits local changes of internal validity indices to split or merge clusters. Our split and merge k-means issues criterion functions to select clusters to be split or merged and fitness assessments on cluster structure changes. Experiments on standard test data sets show that this approach selects an accurate number of clusters with reasonable runtime and accuracy.
Keywords
pattern clustering; statistical analysis; automatic cluster number selection; internal validity indices; split and merge k-means approach; standard test data; Clustering algorithms; Clustering methods; Databases; Expert systems; Information retrieval; Knowledge management; Large-scale systems; Navigation; Runtime; Testing; cluster number selection; k-means; split and merge; validity indices;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Application, 2009. DEXA '09. 20th International Workshop on
Conference_Location
Linz
ISSN
1529-4188
Print_ISBN
978-0-7695-3763-4
Type
conf
DOI
10.1109/DEXA.2009.39
Filename
5337108
Link To Document