Title :
Improving X-means clustering with MNDL
Author :
Shahbaba, Mahdi ; Beheshti, Soosan
Author_Institution :
Dept. of Electr. & Comput. Eng., Ryerson Univ., Toronto, ON, Canada
Abstract :
Estimating the true number of clusters for an unlabeled data set is one of the most important limitations in clustering. To solve this issue, many approaches with different assumptions have been proposed in the literature. X-means clustering is one of the proposed methods, which employs Bayesian Information Criterion (BIC) to approximate the correct number of clusters. In this paper, we propose the use of Minimum Noiseless Description Length (MNDL) as a cluster splitting criterion for X-means clustering. MNDL is able to find the optimum splitting criterion for X-means clustering. Simulation results demonstrate that MNDL splitting criterion has the same computational complexity as BIC but, predicts the true number of clusters more often.
Keywords :
Bayes methods; pattern clustering; BIC; Bayesian information criterion; MNDL; X-means clustering; cluster splitting criterion; minimum noiseless description length; unlabeled data set; Abstracts; Bayesian methods; Equations; Ink; Noise; BIC; Clustering; MNDL; X-means;
Conference_Titel :
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-0381-1
Electronic_ISBN :
978-1-4673-0380-4
DOI :
10.1109/ISSPA.2012.6310493