Mixture models for co-occurrence and histogram data

Author

Hofmann, Thomas ; Puzicha, Jan

Author_Institution

Artificial Intelligence Lab., MIT, Cambridge, MA, USA

Volume

1

fYear

1998

fDate

16-20 Aug 1998

Firstpage

192

Abstract

Modeling and predicting co-occurrences of events is a fundamental problem of unsupervised learning. We develop a general statistical framework for analyzing co-occurrence data based on probabilistic clustering by mixture models. More specifically, we discuss three models which pursue different modeling goals and which differ in the way they define the probabilistic partitioning of the observations. Adopting the maximum likelihood principle, annealed EM algorithms are derived for parameter estimation. From the class of potential applications in pattern recognition and data analysis, we have chosen document retrieval, language modeling, and unsupervised texture segmentation to test and evaluate the proposed algorithms

Keywords

image segmentation; maximum likelihood estimation; pattern clustering; probability; simulated annealing; unsupervised learning; annealed EM algorithms; co-occurrence; data analysis; document retrieval; general statistical framework; histogram data; language modeling; maximum likelihood principle; mixture models; probabilistic clustering; probabilistic partitioning; unsupervised learning; unsupervised texture segmentation; Annealing; Clustering algorithms; Data analysis; Histograms; Maximum likelihood estimation; Parameter estimation; Partitioning algorithms; Pattern recognition; Predictive models; Unsupervised learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on

Conference_Location

Brisbane, Qld.

ISSN

1051-4651

Print_ISBN

0-8186-8512-3

Type

conf

DOI

10.1109/ICPR.1998.711113

Filename

711113