Title :
Mixture Model Clustering of Uncertain Data
Author :
Hamdan, Hani ; Govaert, Gêrard
Author_Institution :
CETIM, Senlis
Abstract :
This paper addresses the problem of fitting mixture densities to uncertain data using the EM algorithm. Uncertain data are modelled by multivariate uncertainty zones which constitute a generalization of multivariate interval-valued data. We develop an EM algorithm to treat uncertainty zones around points of Ropfp in order to estimate the parameters of a mixture model defined on Ropfp and obtain a fuzzy clustering or partition. This EM algorithm requires the evaluation of multidimensional integrals over each uncertainty zone at each iteration. In the diagonal Gaussian mixture model case, these integrals can be computed by simply using the one-dimensional normal cumulative distribution function. Results on simulated data indicate that the proposed algorithm can estimate the true underlying density better than the classical EM algorithm applied to the imprecise data, especially when the imprecision degree is high
Keywords :
Gaussian processes; fuzzy set theory; pattern clustering; statistical distributions; uncertainty handling; 1D normal cumulative distribution function; diagonal Gaussian mixture model; fuzzy clustering; fuzzy partition; mixture density fitting; mixture model clustering; multidimensional integrals; multivariate interval-valued data generalization; multivariate uncertainty zones; uncertain data clustering; Clustering algorithms; Distributed computing; Distribution functions; Heuristic algorithms; Iterative algorithms; Multidimensional systems; Parameter estimation; Partitioning algorithms; Prototypes; Uncertainty;
Conference_Titel :
Fuzzy Systems, 2005. FUZZ '05. The 14th IEEE International Conference on
Conference_Location :
Reno, NV
Print_ISBN :
0-7803-9159-4
DOI :
10.1109/FUZZY.2005.1452510