DocumentCode
3028653
Title
Minimum message length hidden Markov modelling
Author
Edgoose, Timothy ; Allison, Lloyd
Author_Institution
Dept. of Comput. Sci., Monash Univ., Clayton, Vic., Australia
fYear
1998
fDate
30 Mar-1 Apr 1998
Firstpage
169
Lastpage
178
Abstract
This paper describes a minimum message length (MML) approach to finding the most appropriate hidden Markov model (HMM) to describe a given sequence of observations. An MML estimate for the expected length of a two-part message stating a specific HMM and the observations given this model is presented along with an effective search strategy for finding the best number of states for the model. The information estimate enables two models with different numbers of states to be fairly compared which is necessary if the search of this complex model space is to avoid the worst locally optimal solutions. The general purpose MML classifier `Snob´ has been extended and the new program `tSnob´ is tested on `synthetic´ data and a large `real world´ dataset. The MML measure is found to be an improvement on the Bayesian information criteria (BIG) and the un-supervised search strategy
Keywords
encoding; hidden Markov models; pattern classification; search problems; sequences; Bayesian information criteria; Snob; effective search strategy; encoding; general purpose MML classifier; hidden Markov modelling; information estimate; locally optimal solutions; minimum message length HMM; model space search; model states; observations sequence; real world dataset; synthetic data; tSnob; unsupervised search strategy; Bayesian methods; Computer science; Hidden Markov models; Length measurement; Markov processes; Mathematical model; Search methods; Speech recognition; State estimation; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference, 1998. DCC '98. Proceedings
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
0-8186-8406-2
Type
conf
DOI
10.1109/DCC.1998.672145
Filename
672145
Link To Document