DocumentCode :
110144
Title :
Deterministic Feature Selection for K-Means Clustering
Author :
Boutsidis, Christos ; Magdon-Ismail, Malik
Author_Institution :
Christos Boutsidis Math. Sci. Dept., IBM T. J. Watson Res. Center, Ossining, NY, USA
Volume :
59
Issue :
9
fYear :
2013
fDate :
Sept. 2013
Firstpage :
6099
Lastpage :
6110
Abstract :
We study feature selection for k-means clustering. Although the literature contains many methods with good empirical performance, algorithms with provable theoretical behavior have only recently been developed. Unfortunately, these algorithms are randomized and fail with, say, a constant probability. We present the first deterministic feature selection algorithm for k-means clustering with relative error guarantees. At the heart of our algorithm lies a deterministic method for decompositions of the identity and a structural result which quantifies some of the tradeoffs in dimensionality reduction.
Keywords :
computational complexity; deterministic algorithms; feature extraction; pattern clustering; probability; unsupervised learning; constant probability; deterministic feature selection algorithm; deterministic method; dimensionality reduction; identity decomposition; k-means clustering; relative error guarantees; unsupervised learning; Approximation algorithms; Approximation methods; Clustering algorithms; Feature extraction; Linear programming; Partitioning algorithms; Vectors; Clustering methods; dimensionality reduction; unsupervised learning;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/TIT.2013.2255021
Filename :
6488848
Link To Document :
بازگشت