DocumentCode
312134
Title
Bayesian adaptation of speech recognizers to field speech data
Author
Miglietta, C.G. ; Mokbel, Chajic ; Jouvet, Denis ; Monne, J.
Author_Institution
CNET, Lannion, France
Volume
2
fYear
1996
fDate
3-6 Oct 1996
Firstpage
917
Abstract
The article studies a Bayesian (or Maximum A Posteriori MAP) approach to the adaptation of continuous density hidden Markov models (CDHMMs) to a specific condition of a speech recognition application. In order to improve the model robustness, CDHMMs formerly trained from laboratory data are then adapted using context dependent field utterances. Two specific problems have to be faced when using the MAP approach: the estimation of the a priori distribution parameters and the lack of field adaptation data for some distributions of the CDHMM. To estimate the a priori distribution parameters, we need to identify different realizations of the model parameters. Three different solutions are proposed and evaluated. To overcome the lack of adaptation data, field acoustical training frames may be shared among similar distributions. This is performed using an acoustical tree, obtained by progressively clustering the model distributions. Recognition results show that MAP adapted models significantly outperform those trained by maximum likelihood (ML), specifically when the field data set is small
Keywords
Bayes methods; acoustic signal processing; hidden Markov models; maximum likelihood estimation; speech recognition; Bayesian adaptation; CDHMMs; MAP adapted models; MAP approach; Maximum A Posteriori MAP; a priori distribution parameters; acoustical tree; context dependent field utterances; continuous density hidden Markov models; field acoustical training frames; field adaptation data; field data set; field speech data; maximum likelihood; model distributions; model parameters; model robustness; speech recognition application; speech recognizers; Adaptation model; Bayesian methods; Covariance matrix; Databases; Equations; Iterative algorithms; Laboratories; Random variables; Robustness; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607751
Filename
607751
Link To Document