Acoustic model adaptation using first order prediction for reverberant speech

Author

Takiguchi, Tetsuya ; Nishimura, Masafumi

Author_Institution

Tokyo Res. Lab., IBM Japan Ltd., Japan

Volume

1

fYear

2004

fDate

17-21 May 2004

Abstract

The paper describes a hands-free speech recognition technique based on acoustic model adaptation to reverberant speech. In hands-free speech recognition, the recognition accuracy is degraded by reverberation, since each segment of speech is affected by the reflection energy of the preceding segment. To compensate for the reflection signal, we introduce a frame-by-frame adaptation method, adding the reflection signal to the means of the acoustic model. The reflection signal is approximated by a first-order linear prediction from the preceding frame, and the linear prediction coefficient is estimated by a maximum likelihood method by using the EM algorithm, which maximizes the likelihood of the adaptation data. Its effectiveness is confirmed by word recognition experiments on reverberant speech.

Keywords

acoustic signal processing; acoustic wave reflection; adaptive signal processing; maximum likelihood estimation; optimisation; prediction theory; reverberation; speech recognition; EM algorithm; acoustic model adaptation; first order prediction; first-order linear prediction; frame-by-frame adaptation; hands-free speech recognition; maximum likelihood estimation; reflection energy; reverberant speech; reverberation; word recognition; Acoustic distortion; Acoustic reflection; Adaptation model; Cepstral analysis; Degradation; Hidden Markov models; Microphone arrays; Predictive models; Reverberation; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-8484-9

Type

conf

DOI

10.1109/ICASSP.2004.1326124

Filename

1326124