DocumentCode
3060075
Title
A simultaneous two-level clustering algorithm for automatic model selection
Author
Cabanes, Guénaël ; Bennani, Younès
Author_Institution
Univ. of Paris 13, Villetaneuse
fYear
2007
fDate
13-15 Dec. 2007
Firstpage
316
Lastpage
321
Abstract
One of the most crucial questions in many real-world cluster applications is determining a suitable number of clusters, also known as the model selection problem. Determining the optimum number of clusters is an ill posed problem for which there is no simple way of knowing that number without a priori knowledge. In this paper we propose a new two-level clustering algorithm based on self organizing map, called S2L-SOM, which allows an automatic determination of the number of clusters during learning. Estimating true numbers of clusters is related to the cluster stability which involved the validity of clusters generated by the learning algorithm. To measure this stability we use the sub-sampling method. The great advantage of our proposed algorithm, compared to the common partitional clustering methods, is that it is not restricted to convex clusters but can recognize arbitrarily shaped clusters. The validity of this algorithm is superior to standard two-level clustering methods such as SOM+k-means and SOM+Hierarchical agglomerative clustering. This is demonstrated on a set of critical clustering problems.
Keywords
learning (artificial intelligence); pattern clustering; sampling methods; self-organising feature maps; S2L-SOM; arbitrarily shaped clusters; automatic cluster determination; automatic model selection; cluster stability; convex clusters; learning algorithm; model selection problem; partitional clustering methods; self organizing map; simultaneous two-level clustering algorithm; sub-sampling method; Clustering algorithms; Clustering methods; Data visualization; Machine learning; Machine learning algorithms; Organizing; Partitioning algorithms; Stability; Testing; Unsupervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location
Cincinnati, OH
Print_ISBN
978-0-7695-3069-7
Type
conf
DOI
10.1109/ICMLA.2007.71
Filename
4457250
Link To Document