Title :
A Multi-Resolution Hidden Markov Model Using Class-Specific Features
Author :
Baggenstoss, Paul M.
Author_Institution :
Naval Undersea Warfare Center, Newport, RI, USA
Abstract :
We apply the PDF projection theorem to generalize the hidden Markov model (HMM) to accommodate multiple simultaneous segmentations of the raw data and multiple feature extraction transformations. Different segment sizes and feature transformations are assigned to each state. The algorithm averages over all allowable segmentations by mapping the segmentations to a “proxy” HMM and using the forward procedure. A by-product of the algorithm is the set of a posteriori state probability estimates that serve as a description of the input data. These probabilities have simultaneously the temporal resolution of the smallest processing windows and the processing gain and frequency resolution of the largest processing windows. The method is demonstrated on the problem of precisely modeling the consonant “T” in order to detect the presence of a distinct “burst” component. We compare the algorithm against standard speech analysis methods using data from the TIMIT corpus.
Keywords :
hidden Markov models; probability; speech processing; PDF projection theorem; a posteriori state probability estimates; class-specific features; feature transformation; frequency resolution; multiple feature extraction; multiple simultaneous segmentation; multiresolution hidden Markov model; processing gain; temporal resolution; Automatic speech recognition; Cepstral analysis; Digital signal processing; Frequency; Hidden Markov models; Permission; Signal processing; Speech analysis; Speech processing; Wavelet packets; Markov processes; speech processing; time series analysis;
Journal_Title :
Signal Processing, IEEE Transactions on
DOI :
10.1109/TSP.2010.2052458