DocumentCode :
3127794
Title :
Kernel-Based Clustering with Automatic Cluster Number Selection
Author :
Wang, Chang-Dong ; Lai, Jian-Huang ; Huang, Dong
Author_Institution :
Sch. of Inf. Sci. & Technol., Sun Yat-sen Univ., Guangzhou, China
fYear :
2011
fDate :
11-11 Dec. 2011
Firstpage :
293
Lastpage :
299
Abstract :
Kernel k-means is one of the most well-known kernel-based clustering methods for discovering nonlinearly separable clusters. However, like its original counterpart k-means, kernel k-means has two inherent drawbacks: (1) it is easily trapped into degenerate local minima when the prototypes of clusters are ill-initialized, and (2) the actual number of clusters has to be provided in advance. Although some algorithms have been proposed to handle the first problem, there is still a lack of methods for automatically estimating the number of clusters in kernel space. In this paper, inspired by the on-line learning framework and the rival penalization mechanism, we propose a novel kernel-based clustering method with automatic cluster number selection (KeCans for short). In KeCans, prototypes are represented by a prototype descriptor, which is a real-valued matrix with each row representing a prototype. The prototype descriptor is allocated with more than the actual number of rows in initialization. Rival penalization is utilized in competition process to eliminate the redundant rows. Experimental results demonstrate the effectiveness of the proposed method in revealing the real number of clusters in kernel space. And compared with the state-of-the-art kernel-based clustering algorithms, the proposed method achieves comparable clustering results.
Keywords :
learning (artificial intelligence); matrix algebra; pattern clustering; KeCans; automatic cluster number selection; kernel k-means; kernel space; kernel-based clustering methods; online learning framework; prototype descriptor; real-valued matrix; rival penalization; Arrays; Clustering algorithms; Clustering methods; Convergence; Indexes; Kernel; Prototypes; cluster number selection; data clustering; kernel-based clustering; on-line learning; rival penalization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
Type :
conf
DOI :
10.1109/ICDMW.2011.107
Filename :
6137393
Link To Document :
بازگشت