Title :
K-means Clustering Algorithm in Projected Spaces
Author :
Nasser, A. ; Hamad, Denis
Author_Institution :
L.A.S.L-U.L.C.O, Calais
Abstract :
Clustering has been known as a popular technique for pattern recognition, image processing, and data mining. Unfortunately, all known clustering algorithms tend to break down in high dimensional spaces; this is due to the inherent sparsity of the points. We investigate, in this paper, the use of linear and nonlinear principal manifolds for learning low-dimensional representations for clustering. Several leading methods: PCA, KPCA, Sammon, and CCA are examined and tested in clustering experiments using synthetic and real datasets from the UCI databases. We compare the clustering performance of the K-means algorithm on data projected by these projection methods. The experimental results show that K-means clustering on data projected by KPCA outperforms those projected by the three other methods
Keywords :
learning (artificial intelligence); pattern clustering; principal component analysis; K-means clustering algorithm; UCI database; data mining; image processing; linear principal manifold; low-dimensional representations learning; nonlinear principal manifold; pattern recognition; projected space; real dataset; synthetic dataset; Clustering algorithms; Data mining; Feature extraction; Image processing; Partitioning algorithms; Pattern recognition; Principal component analysis; Spatial databases; Testing; Virtual manufacturing; K-means; clustering; clustering accuracy; projection methods;
Conference_Titel :
Information Fusion, 2006 9th International Conference on
Conference_Location :
Florence
Print_ISBN :
1-4244-0953-5
Electronic_ISBN :
0-9721844-6-5
DOI :
10.1109/ICIF.2006.301737