DocumentCode
2179063
Title
Discriminative training for Bayesian sensing hidden Markov models
Author
Saon, George ; Chien, Jen-Tzung
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5316
Lastpage
5319
Abstract
We describe feature space and model space discriminative training for a new class of acoustic models called Bayesian sensing hidden Markov models (BS-HMMs). In BS-HMMs, speech data is represented by a set of state-dependent basis vectors. The relevance of a feature vector to different bases is determined by the precision matrices of the sensing weights. The basis vectors and the precision matrices of the reconstruction errors are jointly estimated by optimizing a maximum mutual information (MMI) criterion. Additionally, we discuss the training of an fMPE-style discriminative feature transformation under the same criterion given these models. Experimental results on an LVCSR task show that the proposed models outperform discriminatively trained conventional HMMs with Gaussian mixture models (GMMs). Cross-adapting the baseline GMM-HMMs to the BS-HMM output yields a 6% relative gain which indicates that the two systems make different errors.
Keywords
belief networks; hidden Markov models; matrix algebra; speech processing; BS-HMM; Bayesian sensing hidden Markov models; LVCSR; MMI criterion; discriminative training; fMPE-style discriminative feature transformation; matrices; maximum mutual information criterion; speech data; state-dependent basis vectors; Acoustics; Bayesian methods; Hidden Markov models; Sensors; Smoothing methods; Training; Transforms; Bayesian learning; basis representation; discriminative training;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947558
Filename
5947558
Link To Document