Title :
Mixture models for co-occurrence and histogram data
Author :
Hofmann, Thomas ; Puzicha, Jan
Author_Institution :
Artificial Intelligence Lab., MIT, Cambridge, MA, USA
Abstract :
Modeling and predicting co-occurrences of events is a fundamental problem of unsupervised learning. We develop a general statistical framework for analyzing co-occurrence data based on probabilistic clustering by mixture models. More specifically, we discuss three models which pursue different modeling goals and which differ in the way they define the probabilistic partitioning of the observations. Adopting the maximum likelihood principle, annealed EM algorithms are derived for parameter estimation. From the class of potential applications in pattern recognition and data analysis, we have chosen document retrieval, language modeling, and unsupervised texture segmentation to test and evaluate the proposed algorithms
Keywords :
image segmentation; maximum likelihood estimation; pattern clustering; probability; simulated annealing; unsupervised learning; annealed EM algorithms; co-occurrence; data analysis; document retrieval; general statistical framework; histogram data; language modeling; maximum likelihood principle; mixture models; probabilistic clustering; probabilistic partitioning; unsupervised learning; unsupervised texture segmentation; Annealing; Clustering algorithms; Data analysis; Histograms; Maximum likelihood estimation; Parameter estimation; Partitioning algorithms; Pattern recognition; Predictive models; Unsupervised learning;
Conference_Titel :
Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-8186-8512-3
DOI :
10.1109/ICPR.1998.711113