DocumentCode
573185
Title
Improving X-means clustering with MNDL
Author
Shahbaba, Mahdi ; Beheshti, Soosan
Author_Institution
Dept. of Electr. & Comput. Eng., Ryerson Univ., Toronto, ON, Canada
fYear
2012
fDate
2-5 July 2012
Firstpage
1298
Lastpage
1302
Abstract
Estimating the true number of clusters for an unlabeled data set is one of the most important limitations in clustering. To solve this issue, many approaches with different assumptions have been proposed in the literature. X-means clustering is one of the proposed methods, which employs Bayesian Information Criterion (BIC) to approximate the correct number of clusters. In this paper, we propose the use of Minimum Noiseless Description Length (MNDL) as a cluster splitting criterion for X-means clustering. MNDL is able to find the optimum splitting criterion for X-means clustering. Simulation results demonstrate that MNDL splitting criterion has the same computational complexity as BIC but, predicts the true number of clusters more often.
Keywords
Bayes methods; pattern clustering; BIC; Bayesian information criterion; MNDL; X-means clustering; cluster splitting criterion; minimum noiseless description length; unlabeled data set; Abstracts; Bayesian methods; Equations; Ink; Noise; BIC; Clustering; MNDL; X-means;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location
Montreal, QC
Print_ISBN
978-1-4673-0381-1
Electronic_ISBN
978-1-4673-0380-4
Type
conf
DOI
10.1109/ISSPA.2012.6310493
Filename
6310493
Link To Document