DocumentCode :
2985317
Title :
An Ellipsoidal K-Means for Document Clustering
Author :
Dzogang, F. ; Marsala, Christophe ; Lesot, M. ; Rifqi, Maria
Author_Institution :
LIP6, Univ. Pierre et Marie Curie - Paris 6, Paris, France
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
221
Lastpage :
230
Abstract :
We propose an extension of the spherical K-means algorithm to deal with settings where the number of data points is largely inferior to the number of dimensions. We assume the data to lie in local and dense regions of the original space and we propose to embed each cluster into its specific ellipsoid. A new objective function is introduced, analytical solutions are derived for both the centroids and the associated ellipsoids. Furthermore, a study on the complexity of this algorithm highlights that it is of same order as the regular K-means algorithm. Results on both synthetic and real data show the efficiency of the proposed method.
Keywords :
computational complexity; document handling; pattern clustering; algorithm complexity; document clustering; ellipsoid; ellipsoidal k-means; spherical k-means algorithm; Clustering algorithms; Ellipsoids; Feature extraction; Linear programming; Partitioning algorithms; Tuning; Vectors; clustering; feature selection; information retrieval; spherical k-means;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.126
Filename :
6413900
Link To Document :
بازگشت