DocumentCode
2448956
Title
A large scale clustering scheme for kernel K-Means
Author
Zhang, Rong ; Rudnicky, Alexander I.
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
4
fYear
2002
fDate
2002
Firstpage
289
Abstract
Kernel functions can be viewed as a non-linear transformation that increases the separability of the input data by mapping them to a new high dimensional space. The incorporation of kernel functions enables the K-Means algorithm to explore the inherent data pattern in the new space. However, the previous applications of the kernel K-Means algorithm are confined to small corpora due to its expensive computation and storage cost. To overcome these obstacles, we propose a new clustering scheme which changes the clustering order from the sequence of samples to the sequence of kernels, and employs a disk-based strategy to control data. The new clustering scheme has been demonstrated to be very efficient for a large corpus by our experiments on handwritten digits recognition, in which more than 90% of the running time was saved.
Keywords
handwritten character recognition; learning (artificial intelligence); matrix algebra; pattern classification; pattern clustering; data pattern; disk-based strategy; handwritten digits recognition; high dimensional space; kernel K-Means; large scale clustering scheme; nonlinear transformation; separability; Clustering algorithms; Computational efficiency; Computer science; Euclidean distance; Kernel; Large-scale systems; Learning systems; Machine learning; Partitioning algorithms; Unsupervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2002. Proceedings. 16th International Conference on
ISSN
1051-4651
Print_ISBN
0-7695-1695-X
Type
conf
DOI
10.1109/ICPR.2002.1047453
Filename
1047453
Link To Document