DocumentCode
1404081
Title
On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition
Author
Huo, Qiang ; Lee, Chin-Hui
Author_Institution
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Volume
6
Issue
4
fYear
1998
fDate
7/1/1998 12:00:00 AM
Firstpage
386
Lastpage
397
Abstract
We extend our previously proposed quasi-Bayes adaptive learning framework to cope with the correlated continuous density hidden Markov models (HMMs) with Gaussian mixture state observation densities in which all mean vectors are assumed to be correlated and have a joint prior distribution. A successive approximation algorithm is proposed to implement the correlated mean vectors´ updating. As an example, by applying the method to an on-line speaker adaptation application, the algorithm is experimentally shown to be asymptotically convergent as well as being able to enhance the efficiency and the effectiveness of the Bayes learning by taking into account the correlation information between different model parameters. The technique can be used to cope with the time-varying nature of some acoustic and environmental variabilities, including mismatches caused by changing speakers, channels, transducers, environments, and so on
Keywords
Bayes methods; Gaussian processes; adaptive systems; correlation methods; hidden Markov models; learning (artificial intelligence); speech recognition; Gaussian mixture state observation densities; HMM; acoustic variabilities; channels; correlated continuous density hidden Markov models; correlated mean vectors updating; efficiency; joint prior distribution; mismatches; on-line adaptive learning; on-line speaker adaptation; quasi-Bayes adaptive learning framework; speakers; speech recognition; successive approximation algorithm; time-varying variabilities; transducers; Acoustic testing; Acoustic transducers; Approximation algorithms; Automatic speech recognition; Bayesian methods; Degradation; Hidden Markov models; Loudspeakers; Recursive estimation; Speech recognition;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.701369
Filename
701369
Link To Document