DocumentCode
417273
Title
Acoustic model adaptation using first order prediction for reverberant speech
Author
Takiguchi, Tetsuya ; Nishimura, Masafumi
Author_Institution
Tokyo Res. Lab., IBM Japan Ltd., Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper describes a hands-free speech recognition technique based on acoustic model adaptation to reverberant speech. In hands-free speech recognition, the recognition accuracy is degraded by reverberation, since each segment of speech is affected by the reflection energy of the preceding segment. To compensate for the reflection signal, we introduce a frame-by-frame adaptation method, adding the reflection signal to the means of the acoustic model. The reflection signal is approximated by a first-order linear prediction from the preceding frame, and the linear prediction coefficient is estimated by a maximum likelihood method by using the EM algorithm, which maximizes the likelihood of the adaptation data. Its effectiveness is confirmed by word recognition experiments on reverberant speech.
Keywords
acoustic signal processing; acoustic wave reflection; adaptive signal processing; maximum likelihood estimation; optimisation; prediction theory; reverberation; speech recognition; EM algorithm; acoustic model adaptation; first order prediction; first-order linear prediction; frame-by-frame adaptation; hands-free speech recognition; maximum likelihood estimation; reflection energy; reverberant speech; reverberation; word recognition; Acoustic distortion; Acoustic reflection; Adaptation model; Cepstral analysis; Degradation; Hidden Markov models; Microphone arrays; Predictive models; Reverberation; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326124
Filename
1326124
Link To Document