Title :
Multiple principal component analyses and projective clustering
Author :
Kerdprasop, Nittaya ; Kerdprasop, Kittisak
Author_Institution :
Data Eng. & Knowledge Discovery Res. Unit, Suranaree Univ. of Technol., Thailand
Abstract :
Projective clustering is a clustering technique for high dimensional data with the inherent sparsity of the data points. To overcome the unreliable measure of similarity among data points in high dimensions, all data points are projected to a lower dimensional sub-space. Principal component analysis (PCA) is an efficient method to dimensionality reduction by projecting all points to a lower dimensional subspace so that the information loss is minimized. However, PCA does not handle well the situation that different clusters are formed in different subspaces. We propose a method of multiple principal component analysis for iteratively computing projective clusters. The objective function is designed to determine the subspace associated with each cluster. Some experiments have been carried out to show the effectiveness of the proposed method.
Keywords :
database management systems; pattern clustering; principal component analysis; PCA; high dimensional data sets; multiple principal component analyses; projective clustering; Algorithm design and analysis; Clustering algorithms; Clustering methods; Conferences; Data engineering; Knowledge engineering; Partitioning algorithms; Principal component analysis; Spatial databases; Statistical analysis;
Conference_Titel :
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
Print_ISBN :
0-7695-2424-9
DOI :
10.1109/DEXA.2005.140