Title :
Acoustic model adaptation using first order prediction for reverberant speech
Author :
Takiguchi, Tetsuya ; Nishimura, Masafumi
Author_Institution :
Tokyo Res. Lab., IBM Japan Ltd., Japan
Abstract :
The paper describes a hands-free speech recognition technique based on acoustic model adaptation to reverberant speech. In hands-free speech recognition, the recognition accuracy is degraded by reverberation, since each segment of speech is affected by the reflection energy of the preceding segment. To compensate for the reflection signal, we introduce a frame-by-frame adaptation method, adding the reflection signal to the means of the acoustic model. The reflection signal is approximated by a first-order linear prediction from the preceding frame, and the linear prediction coefficient is estimated by a maximum likelihood method by using the EM algorithm, which maximizes the likelihood of the adaptation data. Its effectiveness is confirmed by word recognition experiments on reverberant speech.
Keywords :
acoustic signal processing; acoustic wave reflection; adaptive signal processing; maximum likelihood estimation; optimisation; prediction theory; reverberation; speech recognition; EM algorithm; acoustic model adaptation; first order prediction; first-order linear prediction; frame-by-frame adaptation; hands-free speech recognition; maximum likelihood estimation; reflection energy; reverberant speech; reverberation; word recognition; Acoustic distortion; Acoustic reflection; Adaptation model; Cepstral analysis; Degradation; Hidden Markov models; Microphone arrays; Predictive models; Reverberation; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326124