• DocumentCode
    327681
  • Title

    Mixture models for co-occurrence and histogram data

  • Author

    Hofmann, Thomas ; Puzicha, Jan

  • Author_Institution
    Artificial Intelligence Lab., MIT, Cambridge, MA, USA
  • Volume
    1
  • fYear
    1998
  • fDate
    16-20 Aug 1998
  • Firstpage
    192
  • Abstract
    Modeling and predicting co-occurrences of events is a fundamental problem of unsupervised learning. We develop a general statistical framework for analyzing co-occurrence data based on probabilistic clustering by mixture models. More specifically, we discuss three models which pursue different modeling goals and which differ in the way they define the probabilistic partitioning of the observations. Adopting the maximum likelihood principle, annealed EM algorithms are derived for parameter estimation. From the class of potential applications in pattern recognition and data analysis, we have chosen document retrieval, language modeling, and unsupervised texture segmentation to test and evaluate the proposed algorithms
  • Keywords
    image segmentation; maximum likelihood estimation; pattern clustering; probability; simulated annealing; unsupervised learning; annealed EM algorithms; co-occurrence; data analysis; document retrieval; general statistical framework; histogram data; language modeling; maximum likelihood principle; mixture models; probabilistic clustering; probabilistic partitioning; unsupervised learning; unsupervised texture segmentation; Annealing; Clustering algorithms; Data analysis; Histograms; Maximum likelihood estimation; Parameter estimation; Partitioning algorithms; Pattern recognition; Predictive models; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
  • Conference_Location
    Brisbane, Qld.
  • ISSN
    1051-4651
  • Print_ISBN
    0-8186-8512-3
  • Type

    conf

  • DOI
    10.1109/ICPR.1998.711113
  • Filename
    711113