• DocumentCode
    888248
  • Title

    A similarity-based robust clustering method

  • Author

    Yang, Miin-Shen ; Wu, Kuo-Lung

  • Author_Institution
    Dept. of Appl. Math., Chung Yuan Christian Univ., Chung-li, Taiwan
  • Volume
    26
  • Issue
    4
  • fYear
    2004
  • fDate
    4/1/2004 12:00:00 AM
  • Firstpage
    434
  • Lastpage
    448
  • Abstract
    This paper presents an alternating optimization clustering procedure called a similarity-based clustering method (SCM). It is an effective and robust approach to clustering on the basis of a total similarity objective function related to the approximate density shape estimation. We show that the data points in SCM can self-organize local optimal cluster number and volumes without using cluster validity functions or a variance-covariance matrix. The proposed clustering method is also robust to noise and outliers based on the influence function and gross error sensitivity analysis. Therefore, SCM exhibits three robust clustering characteristics: 1) robust to the initialization (cluster number and initial guesses), 2) robust to cluster volumes (ability to detect different volumes of clusters), and 3) robust to noise and outliers. Several numerical data sets and actual data are used in the SCM to show these good aspects. The computational complexity of SCM is also analyzed. Some experimental results of comparing the proposed SCM with the existing methods show the superiority of the SCM method.
  • Keywords
    computational complexity; covariance matrices; estimation theory; optimisation; pattern clustering; sensitivity analysis; computational complexity; density shape estimation; error sensitivity analysis; optimization clustering procedure; self organized local optimal cluster; similarity based robust clustering; similarity objective function; variance covariance matrix; Clustering algorithms; Clustering methods; Data mining; Noise robustness; Noise shaping; Optimization methods; Pattern recognition; Prototypes; Sensitivity analysis; Shape; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Fuzzy Logic; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Subtraction Technique;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2004.1265860
  • Filename
    1265860