Title :
Initialization of cluster refinement algorithms: a review and comparative study
Author :
He, Ji ; Lan, Man ; Tan, Chew-Lim ; Sung, Sam-Yuan ; Low, Hwee-Boon
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore
Abstract :
Various iterative refinement clustering methods are dependent on the initial state of the model and are capable of obtaining one of their local optima only. Since the task of identifying the global optimization is NP-hard, the study of the initialization method towards a sub-optimization is of great value. This paper reviews the various cluster initialization methods in the literature by categorizing them into three major families, namely random sampling methods, distance optimization methods, and density estimation methods. In addition, using a set of quantitative measures, we assess their performance on a number of synthetic and real-life data sets. Our controlled benchmark identifies two distance optimization methods, namely SCS and KKZ, as complements of the k-means learning characteristics towards a better cluster separation in the output solution.
Keywords :
data mining; learning (artificial intelligence); optimisation; pattern clustering; cluster initialization methods; cluster refinement algorithm; density estimation methods; distance optimization methods; k-means learning characteristics; random sampling methods; Clustering algorithms; Clustering methods; Data mining; Image segmentation; Iterative algorithms; Iterative methods; Optimization methods; Sampling methods; Sun; Unsupervised learning;
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
Print_ISBN :
0-7803-8359-1
DOI :
10.1109/IJCNN.2004.1379917