DocumentCode :
417273
Title :
Acoustic model adaptation using first order prediction for reverberant speech
Author :
Takiguchi, Tetsuya ; Nishimura, Masafumi
Author_Institution :
Tokyo Res. Lab., IBM Japan Ltd., Japan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The paper describes a hands-free speech recognition technique based on acoustic model adaptation to reverberant speech. In hands-free speech recognition, the recognition accuracy is degraded by reverberation, since each segment of speech is affected by the reflection energy of the preceding segment. To compensate for the reflection signal, we introduce a frame-by-frame adaptation method, adding the reflection signal to the means of the acoustic model. The reflection signal is approximated by a first-order linear prediction from the preceding frame, and the linear prediction coefficient is estimated by a maximum likelihood method by using the EM algorithm, which maximizes the likelihood of the adaptation data. Its effectiveness is confirmed by word recognition experiments on reverberant speech.
Keywords :
acoustic signal processing; acoustic wave reflection; adaptive signal processing; maximum likelihood estimation; optimisation; prediction theory; reverberation; speech recognition; EM algorithm; acoustic model adaptation; first order prediction; first-order linear prediction; frame-by-frame adaptation; hands-free speech recognition; maximum likelihood estimation; reflection energy; reverberant speech; reverberation; word recognition; Acoustic distortion; Acoustic reflection; Adaptation model; Cepstral analysis; Degradation; Hidden Markov models; Microphone arrays; Predictive models; Reverberation; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326124
Filename :
1326124
Link To Document :
بازگشت