DocumentCode
888248
Title
A similarity-based robust clustering method
Author
Yang, Miin-Shen ; Wu, Kuo-Lung
Author_Institution
Dept. of Appl. Math., Chung Yuan Christian Univ., Chung-li, Taiwan
Volume
26
Issue
4
fYear
2004
fDate
4/1/2004 12:00:00 AM
Firstpage
434
Lastpage
448
Abstract
This paper presents an alternating optimization clustering procedure called a similarity-based clustering method (SCM). It is an effective and robust approach to clustering on the basis of a total similarity objective function related to the approximate density shape estimation. We show that the data points in SCM can self-organize local optimal cluster number and volumes without using cluster validity functions or a variance-covariance matrix. The proposed clustering method is also robust to noise and outliers based on the influence function and gross error sensitivity analysis. Therefore, SCM exhibits three robust clustering characteristics: 1) robust to the initialization (cluster number and initial guesses), 2) robust to cluster volumes (ability to detect different volumes of clusters), and 3) robust to noise and outliers. Several numerical data sets and actual data are used in the SCM to show these good aspects. The computational complexity of SCM is also analyzed. Some experimental results of comparing the proposed SCM with the existing methods show the superiority of the SCM method.
Keywords
computational complexity; covariance matrices; estimation theory; optimisation; pattern clustering; sensitivity analysis; computational complexity; density shape estimation; error sensitivity analysis; optimization clustering procedure; self organized local optimal cluster; similarity based robust clustering; similarity objective function; variance covariance matrix; Clustering algorithms; Clustering methods; Data mining; Noise robustness; Noise shaping; Optimization methods; Pattern recognition; Prototypes; Sensitivity analysis; Shape; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Fuzzy Logic; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Subtraction Technique;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2004.1265860
Filename
1265860
Link To Document