Title :
On adaptive decision rules and decision parameter adaptation for automatic speech recognition
Author :
Lee, Chin-Hui ; Huo, Qiang
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
Recent advances in automatic speech recognition are accomplished by designing a plug-in maximum a posteriori decision rule such that the forms of the acoustic and language model distributions are specified and the parameters of the assumed distributions are estimated from a collection of speech and language training corpora. Maximum-likelihood point estimation is by far the most prevailing training method. However, due to the problems of unknown speech distributions, sparse training data, high spectral and temporal variabilities in speech, and possible mismatch between training and testing conditions, a dynamic training strategy is needed. To cope with the changing speakers and speaking conditions in real operational conditions for high-performance speech recognition, such paradigms incorporate a small amount of speaker and environment specific adaptation data into the training process. Bayesian adaptive learning is an optimal way to combine prior knowledge in an existing collection of general models with a new set of condition-specific adaptation data. In this paper, the mathematical framework for Bayesian adaptation of acoustic and language model parameters is first described. Maximum a posteriori point estimation is then developed for hidden Markov models and a number of useful parameters densities commonly used in automatic speech recognition and natural language processing.
Keywords :
Bayes methods; decision theory; hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; natural languages; speech recognition; Bayes predictive classification rule; Bayes risk consistency; Bayesian adaptive learning; acoustic model; acoustic modeling; adaptive decision rule; adaptive decision rules; automatic speech recognition; conjugate density; decision parameter adaptation; dynamic training strategy; expectation maximization; hidden Markov models; incomplete data problem; language model; language modeling; mathematical framework; maximum a-posteriori decision rule; maximum a-posteriori point estimation; maximum mutual information; maximum-likelihood linear regression; maximum-likelihood point estimation; minimax classification rule; minimum classification error learning; minimum discrimination information; natural language processing; optimal Bayes decision rule; prior density; quasi-Bayes learning; recursive Bayesian learning; statistical decision theory; Automatic speech recognition; Bayesian methods; Hidden Markov models; Loudspeakers; Mathematical model; Maximum likelihood estimation; Natural languages; Speech recognition; Testing; Training data;
Journal_Title :
Proceedings of the IEEE