Title :
Learning Gaussian Mixture Models With Entropy-Based Criteria
Author :
Benavent, Antonio Peñalver ; Ruiz, Francisco Escolano ; Sáez, Juan Manuel
Author_Institution :
Dept. de Estadistica, Mat. e Inf., Univ. Miguel Hernandez, Elche, Spain
Abstract :
In this paper, we address the problem of estimating the parameters of Gaussian mixture models. Although the expectation-maximization (EM) algorithm yields the maximum-likelihood (ML) solution, its sensitivity to the selection of the starting parameters is well-known and it may converge to the boundary of the parameter space. Furthermore, the resulting mixture depends on the number of selected components, but the optimal number of kernels may be unknown beforehand. We introduce the use of the entropy of the probability density function (pdf) associated to each kernel to measure the quality of a given mixture model with a fixed number of kernels. We propose two methods to approximate the entropy of each kernel and a modification of the classical EM algorithm in order to find the optimum number of components of the mixture. Moreover, we use two stopping criteria: a novel global mixture entropy-based criterion called Gaussianity deficiency (GD) and a minimum description length (MDL) principle-based one. Our algorithm, called entropy-based EM (EBEM), starts with a unique kernel and performs only splitting by selecting the worst kernel attending to GD. We have successfully tested it in probability density estimation, pattern classification, and color image segmentation. Experimental results improve the ones of other state-of-the-art model order selection methods.
Keywords :
Gaussian processes; entropy; expectation-maximisation algorithm; image colour analysis; image segmentation; maximum likelihood estimation; pattern classification; EM algorithm; Gaussian mixture model learning; color image segmentation; entropy-based criteria; expectation-maximization algorithm; global mixture entropy-based criterion; maximum-likelihood solution; minimum description length principle; model order selection method; pattern classification; probability density function; Clustering; EM algorithm; entropy estimation; minimum description length (MDL) criterion; mixture models; model order selection; Algorithms; Artificial Intelligence; Data Interpretation, Statistical; Entropy; Models, Statistical; Neural Networks (Computer); Normal Distribution;
Journal_Title :
Neural Networks, IEEE Transactions on
DOI :
10.1109/TNN.2009.2030190