Title :
An Ellipsoidal K-Means for Document Clustering
Author :
Dzogang, F. ; Marsala, Christophe ; Lesot, M. ; Rifqi, Maria
Author_Institution :
LIP6, Univ. Pierre et Marie Curie - Paris 6, Paris, France
Abstract :
We propose an extension of the spherical K-means algorithm to deal with settings where the number of data points is largely inferior to the number of dimensions. We assume the data to lie in local and dense regions of the original space and we propose to embed each cluster into its specific ellipsoid. A new objective function is introduced, analytical solutions are derived for both the centroids and the associated ellipsoids. Furthermore, a study on the complexity of this algorithm highlights that it is of same order as the regular K-means algorithm. Results on both synthetic and real data show the efficiency of the proposed method.
Keywords :
computational complexity; document handling; pattern clustering; algorithm complexity; document clustering; ellipsoid; ellipsoidal k-means; spherical k-means algorithm; Clustering algorithms; Ellipsoids; Feature extraction; Linear programming; Partitioning algorithms; Tuning; Vectors; clustering; feature selection; information retrieval; spherical k-means;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.126