• DocumentCode
    52394
  • Title

    Robust Model-Based Learning via Spatial-EM Algorithm

  • Author

    Kai Yu ; Xin Dang ; Bart, Henry ; Yixin Chen

  • Author_Institution
    Amazon Web Service, Seattle, WA, USA
  • Volume
    27
  • Issue
    6
  • fYear
    2015
  • fDate
    June 1 2015
  • Firstpage
    1670
  • Lastpage
    1682
  • Abstract
    This paper presents a new robust EM algorithm for the finite mixture learning procedures. The proposed Spatial-EM algorithm utilizes median-based location and rank-based scatter estimators to replace sample mean and sample covariance matrix in each M step, hence enhancing stability and robustness of the algorithm. It is robust to outliers and initial values. Compared with many robust mixture learning methods, the Spatial-EM has the advantages of simplicity in implementation and statistical efficiency. We apply Spatial-EM to supervised and unsupervised learning scenarios. More specifically, robust clustering and outlier detection methods based on Spatial-EM have been proposed. We apply the outlier detection to taxonomic research on fish species novelty discovery. Two real datasets are used for clustering analysis. Compared with the regular EM and many other existing methods such as K-median, X-EM and SVM, our method demonstrates superior performance and high robustness.
  • Keywords
    covariance matrices; estimation theory; expectation-maximisation algorithm; pattern clustering; support vector machines; unsupervised learning; K-median; SVM; X-EM; clustering analysis; finite mixture learning procedure; fish species novelty discovery; median-based location; mixture learning method; outlier detection method; rank-based scatter estimator; robust clustering; robust model-based learning; sample covariance matrix; sample mean; spatial-EM algorithm; statistical efficiency; taxonomic research; unsupervised learning; Biological system modeling; Clustering algorithms; Covariance matrices; Data models; Electric breakdown; Maximum likelihood estimation; Robustness; Clustering; EM algorithm; finite mixture; outlier detection; robustness; spatial rank;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2373355
  • Filename
    6964786