Title :
Outlier-aware robust clustering
Author :
Forero, Pedro A. ; Kekatos, Vassilis ; Giannakis, Georgios B.
Author_Institution :
Dept. of Electr. & Comput. Engrg., Univ. of Minnesota, Minneapolis, MN, USA
Abstract :
Clustering is a basic task in a variety of machine learning applications. Partitioning a set of input vectors into compact, well separated subsets can be severely affected by the presence of model incompatible inputs called outliers. The present paper develops robust clustering algorithms for jointly partitioning the data and identifying the outliers. The novel approach relies on translating scarcity of outliers to sparsity in a judiciously defined domain, to robustify three widely used clustering schemes: hard K-means, fuzzy K-means, and probabilistic clustering. Cluster centers and assignments are iteratively updated in closed form. The developed outlier aware algorithms are guaranteed to converge, while their computational complexity is of the same order as their outlier-agnostic counterparts. Preliminary simulations validate the analytical claims.
Keywords :
learning (artificial intelligence); pattern clustering; set theory; data partitioning; fuzzy K-means clustering; hard K-means clustering; machine learning; outlier identification; outlier-aware robust clustering; probabilistic clustering; subsets; Clustering algorithms; Clustering methods; Data models; Optimization; Probabilistic logic; Robustness; Signal processing algorithms; K-means; block coordinate descent; convex relaxation; expectation maximization; robust clustering;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946928