DocumentCode :
857180
Title :
MAP speaker adaptation of state duration distributions for speech recognition
Author :
Yoma, Néstor Becerra ; Sánchez, Jorge Silva
Author_Institution :
Dept. of Electr. Eng., Chile Univ., Santiago, Chile
Volume :
10
Issue :
7
fYear :
2002
fDate :
10/1/2002 12:00:00 AM
Firstpage :
443
Lastpage :
450
Abstract :
This paper presents a framework for maximum a posteriori (MAP) speaker adaptation of state duration distributions in hidden Markov models (HMM). Four key issues of MAP estimation, namely analysis and modeling of state duration distributions, the choice of prior distribution, the specification of the parameters of the prior density and the evaluation of the MAP estimates, are tackled. Moreover, a comparison with an adaptation procedure based on maximum likelihood (ML) estimation is presented, and the problem of truncation of the state duration distribution is addressed from the statistical point of view. The results shown in this paper suggest that the speaker adaptation of temporal restrictions substantially improves the accuracy of speaker-independent (SI) HMM with clean and noisy speech. The method requires a low computational load and a small number of adapting utterances, and can be useful to follow the dynamics of the speaking rate in speech recognition.
Keywords :
hidden Markov models; maximum likelihood estimation; noise; speech recognition; statistical analysis; HMM; MAP estimation; MAP speaker adaptation; MLE; clean speech; hidden Markov models; low computational load; maximum a posteriori speaker adaptation; maximum likelihood estimation; noisy speech; speaking rate dynamics; speech recognition; state duration distribution truncation; temporal restrictions; Additive noise; Error analysis; Hidden Markov models; Maximum likelihood estimation; Noise reduction; Solid modeling; Speech recognition; State estimation; Testing; Viterbi algorithm;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2002.803441
Filename :
1045276
Link To Document :
بازگشت