On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate

Author

Huo, Qiang ; Lee, Chin-Hui

Author_Institution

ATR Interpreting Telephony Res. Labs., Kyoto, Japan

Volume

5

Issue

2

fYear

1997

fDate

3/1/1997 12:00:00 AM

Firstpage

161

Lastpage

172

Abstract

We present a framework of quasi-Bayes (QB) learning of the parameters of the continuous density hidden Markov model (CDHMM) with Gaussian mixture state observation densities. The QB formulation is based on the theory of recursive Bayesian inference. The QB algorithm is designed to incrementally update the hyperparameters of the approximate posterior distribution and the CDHMM parameters simultaneously. By further introducing a simple forgetting mechanism to adjust the contribution of previously observed sample utterances, the algorithm is adaptive in nature and capable of performing an online adaptive learning using only the current sample utterance. It can, thus, be used to cope with the time-varying nature of some acoustic and environmental variabilities, including mismatches caused by changing speakers, channels, and transducers. As an example, the QB learning framework is applied to on-line speaker adaptation and its viability is confirmed in a series of comparative experiments using a 26-letter English alphabet vocabulary

Keywords

Bayes methods; Gaussian processes; adaptive estimation; hidden Markov models; inference mechanisms; learning (artificial intelligence); recursive estimation; speech recognition; 26-letter English alphabet vocabulary; CDHMM parameters; Gaussian mixture state observation densities; QB formulation; acoustic variabilities; approximate posterior distribution; approximate recursive Bayes estimate; continuous density hidden Markov model; environmental variabilities; forgetting mechanism; hyperparameters; on-line adaptive learning; quasi-Bayes learning; recursive Bayesian inference; sample utterances; Acoustic testing; Acoustic transducers; Algorithm design and analysis; Bayesian methods; Hidden Markov models; Inference algorithms; Loudspeakers; Maximum likelihood estimation; Recursive estimation; Speech recognition;

fLanguage

English

Journal_Title

Speech and Audio Processing, IEEE Transactions on

Publisher

ieee

ISSN

1063-6676

Type

jour

DOI

10.1109/89.554778

Filename

554778