Title :
On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate
Author :
Huo, Qiang ; Lee, Chin-Hui
Author_Institution :
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
fDate :
3/1/1997 12:00:00 AM
Abstract :
We present a framework of quasi-Bayes (QB) learning of the parameters of the continuous density hidden Markov model (CDHMM) with Gaussian mixture state observation densities. The QB formulation is based on the theory of recursive Bayesian inference. The QB algorithm is designed to incrementally update the hyperparameters of the approximate posterior distribution and the CDHMM parameters simultaneously. By further introducing a simple forgetting mechanism to adjust the contribution of previously observed sample utterances, the algorithm is adaptive in nature and capable of performing an online adaptive learning using only the current sample utterance. It can, thus, be used to cope with the time-varying nature of some acoustic and environmental variabilities, including mismatches caused by changing speakers, channels, and transducers. As an example, the QB learning framework is applied to on-line speaker adaptation and its viability is confirmed in a series of comparative experiments using a 26-letter English alphabet vocabulary
Keywords :
Bayes methods; Gaussian processes; adaptive estimation; hidden Markov models; inference mechanisms; learning (artificial intelligence); recursive estimation; speech recognition; 26-letter English alphabet vocabulary; CDHMM parameters; Gaussian mixture state observation densities; QB formulation; acoustic variabilities; approximate posterior distribution; approximate recursive Bayes estimate; continuous density hidden Markov model; environmental variabilities; forgetting mechanism; hyperparameters; on-line adaptive learning; quasi-Bayes learning; recursive Bayesian inference; sample utterances; Acoustic testing; Acoustic transducers; Algorithm design and analysis; Bayesian methods; Hidden Markov models; Inference algorithms; Loudspeakers; Maximum likelihood estimation; Recursive estimation; Speech recognition;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on