Title :
Grouped data clustering using a fast mixture-model-based algorithm
Author_Institution :
Lab. des Technol. Nouvelles, Inst. Nat. de Rech. sur les Transp. et leur Securite, Noisy-le-Grand, France
Abstract :
Mixture-model-based clustering has become a popular approach in many data analysis problems for its statistical properties and the implementation simplicity of the EM algorithm. However the computation time of the EM algorithm and its variants increases significantly with the sample size. For large data sets, performing clustering on grouped data constitutes an efficient alternative to speed up the algorithms execution time. A rapid and effective algorithm dedicated to grouped data clustering is then proposed in this paper. Inspired by the Classification EM algorithm (CEM), the proposed approach estimates the missing sample at each iteration. An experimental study using simulated data and real acoustic emission data in the context of a flaw detection application on gas tanks reveals good performances of the proposed approach in terms of partitioning precision and computing time.
Keywords :
data analysis; expectation-maximisation algorithm; pattern clustering; statistical analysis; EM algorithm; data analysis; grouped data clustering; mixture-model-based algorithm; mixture-model-based clustering; statistical properties; Acoustic emission; Classification algorithms; Clustering algorithms; Cybernetics; Data analysis; Histograms; Maximum likelihood detection; Maximum likelihood estimation; Parameter estimation; Partitioning algorithms; Classification EM algorithm; Clustering; EM algorithm; grouped Data; histogram; mixture models;
Conference_Titel :
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location :
San Antonio, TX
Print_ISBN :
978-1-4244-2793-2
Electronic_ISBN :
1062-922X
DOI :
10.1109/ICSMC.2009.5345961