Title :
Landscape of clustering algorithms
Author :
Jain, Anil K. ; Topchy, Alexander ; Law, Martin H C ; Buhmann, Joachim M.
Author_Institution :
Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
Abstract :
Numerous clustering algorithms, their taxonomies and evaluation studies are available in the literature. Despite the diversity of different clustering algorithms, solutions delivered by these algorithms exhibit many commonalities. An analysis of the similarity and properties of clustering objective functions is necessary from the operational/user perspective. We revisit conventional categorization of clustering algorithms and attempt to relate them according to the partitions they produce. We empirically study the similarity of clustering solutions obtained by many traditional as well as relatively recent clustering algorithms on a number of real-world data sets. Sammon´s mapping and a complete-link clustering of the inter-clustering dissimilarity values are performed to detect a meaningful grouping of the objective functions. We find that only a small number of clustering algorithms are sufficient to represent a large spectrum of clustering criteria. For example, interesting groups of clustering algorithms are centered around the graph partitioning, linkage-based and Gaussian mixture model based algorithms.
Keywords :
pattern clustering; statistical analysis; Gaussian mixture model algorithm; Sammon mapping; clustering algorithms; clustering criteria; clustering objective functions; graph partitioning; interclustering dissimilarity values; landscape; linkage algorithm; Algorithm design and analysis; Clustering algorithms; Computer science; Cost function; Data analysis; Data structures; Guidelines; Maximum likelihood detection; Partitioning algorithms; Taxonomy;
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
Print_ISBN :
0-7695-2128-2
DOI :
10.1109/ICPR.2004.1334073