Learning Gaussian Mixture Models With Entropy-Based Criteria

Author

Benavent, Antonio Peñalver ; Ruiz, Francisco Escolano ; Sáez, Juan Manuel

Author_Institution

Dept. de Estadistica, Mat. e Inf., Univ. Miguel Hernandez, Elche, Spain

Volume

20

Issue

11

fYear

2009

Firstpage

1756

Lastpage

1771

Abstract

In this paper, we address the problem of estimating the parameters of Gaussian mixture models. Although the expectation-maximization (EM) algorithm yields the maximum-likelihood (ML) solution, its sensitivity to the selection of the starting parameters is well-known and it may converge to the boundary of the parameter space. Furthermore, the resulting mixture depends on the number of selected components, but the optimal number of kernels may be unknown beforehand. We introduce the use of the entropy of the probability density function (pdf) associated to each kernel to measure the quality of a given mixture model with a fixed number of kernels. We propose two methods to approximate the entropy of each kernel and a modification of the classical EM algorithm in order to find the optimum number of components of the mixture. Moreover, we use two stopping criteria: a novel global mixture entropy-based criterion called Gaussianity deficiency (GD) and a minimum description length (MDL) principle-based one. Our algorithm, called entropy-based EM (EBEM), starts with a unique kernel and performs only splitting by selecting the worst kernel attending to GD. We have successfully tested it in probability density estimation, pattern classification, and color image segmentation. Experimental results improve the ones of other state-of-the-art model order selection methods.

Keywords

Gaussian processes; entropy; expectation-maximisation algorithm; image colour analysis; image segmentation; maximum likelihood estimation; pattern classification; EM algorithm; Gaussian mixture model learning; color image segmentation; entropy-based criteria; expectation-maximization algorithm; global mixture entropy-based criterion; maximum-likelihood solution; minimum description length principle; model order selection method; pattern classification; probability density function; Clustering; EM algorithm; entropy estimation; minimum description length (MDL) criterion; mixture models; model order selection; Algorithms; Artificial Intelligence; Data Interpretation, Statistical; Entropy; Models, Statistical; Neural Networks (Computer); Normal Distribution;

fLanguage

English

Journal_Title

Neural Networks, IEEE Transactions on

Publisher

ieee

ISSN

1045-9227

Type

jour

DOI

10.1109/TNN.2009.2030190

Filename

5247027