• DocumentCode
    678020
  • Title

    Model Selection with BIC and ICL Criteria for Binned Data Clustering by Bin-EM-CEM Algorithms

  • Author

    Hamdan, Hani ; Jingwen Wu

  • Author_Institution
    Dept. of Signal Process. & Electron. Syst, SUPELEC, Gif-sur-Yvette, France
  • fYear
    2013
  • fDate
    13-16 Oct. 2013
  • Firstpage
    3133
  • Lastpage
    3138
  • Abstract
    Several clustering approaches are adapted to binned data in order to accelerate the clustering process or to deal with data of limited precision. Bin-EM-CEM algorithms of fourteen parsimonious Gaussian mixture models are developed. Each model performs differently according to its specific feature. Without knowing any information of the data, a criterion is considered to select the best model in order to obtain a good result. In this article, BIC and ICL criteria are adapted to binned data clustering to choose the bin-EM-CEM algorithm of the right model as well as the number of clusters. By different experiments on simulated data and real data, the performance of BIC and ICL criteria in model selection for binned data clustering are studied and compared on different aspects.
  • Keywords
    Gaussian processes; expectation-maximisation algorithm; pattern clustering; BIC criteria; Bayesian information criterion; Bin-EM-CEM algorithms; ICL criteria; binned data clustering; clustering approach; clustering process; expectation maximisation; integrated completed likelihood; model selection; parsimonious Gaussian mixture models; Adaptation models; Cities and towns; Clustering algorithms; Computational modeling; Data models; Standards; Statistics; BIC; ICL; bin-EM-CEM; clustering; model selection; parsimonious Gaussian mixture models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on
  • Conference_Location
    Manchester
  • Type

    conf

  • DOI
    10.1109/SMC.2013.534
  • Filename
    6722287