• DocumentCode
    3028653
  • Title

    Minimum message length hidden Markov modelling

  • Author

    Edgoose, Timothy ; Allison, Lloyd

  • Author_Institution
    Dept. of Comput. Sci., Monash Univ., Clayton, Vic., Australia
  • fYear
    1998
  • fDate
    30 Mar-1 Apr 1998
  • Firstpage
    169
  • Lastpage
    178
  • Abstract
    This paper describes a minimum message length (MML) approach to finding the most appropriate hidden Markov model (HMM) to describe a given sequence of observations. An MML estimate for the expected length of a two-part message stating a specific HMM and the observations given this model is presented along with an effective search strategy for finding the best number of states for the model. The information estimate enables two models with different numbers of states to be fairly compared which is necessary if the search of this complex model space is to avoid the worst locally optimal solutions. The general purpose MML classifier `Snob´ has been extended and the new program `tSnob´ is tested on `synthetic´ data and a large `real world´ dataset. The MML measure is found to be an improvement on the Bayesian information criteria (BIG) and the un-supervised search strategy
  • Keywords
    encoding; hidden Markov models; pattern classification; search problems; sequences; Bayesian information criteria; Snob; effective search strategy; encoding; general purpose MML classifier; hidden Markov modelling; information estimate; locally optimal solutions; minimum message length HMM; model space search; model states; observations sequence; real world dataset; synthetic data; tSnob; unsupervised search strategy; Bayesian methods; Computer science; Hidden Markov models; Length measurement; Markov processes; Mathematical model; Search methods; Speech recognition; State estimation; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 1998. DCC '98. Proceedings
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    0-8186-8406-2
  • Type

    conf

  • DOI
    10.1109/DCC.1998.672145
  • Filename
    672145