Title :
Robust speech recognition with time-varying filtering, interruptions, and noise
Author :
Lippmann, Richard P. ; Carlson, Beth A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
Speech recognizers trained with quiet wide-band speech degrade dramatically with high-pass, low-pass and notch filtering, with noise and with interruptions of the speech input. A new and simple approach to compensate for these degradations is presented which uses mel-filter-bank (MFB) magnitudes as input features and missing feature theory to dynamically modify the probability computations performed in hidden Markov model recognizers. When the identity of features missing due to filtering or masking is provided, recognition accuracy on a large talker-independent digit recognition task often rises from below 50% to above 95%. These promising results suggest future work to continuously estimate the SNR within MFB bands for dynamic adaptation of speech recognizers
Keywords :
acoustic noise; hidden Markov models; probability; speech recognition; time-varying filters; continuous SNR estimation; degradation compensation; dynamic adaptation; hidden Markov model; high-pass filtering; input features; low-pass filtering; masking; mel-filter-bank magnitudes; missing feature theory; notch filtering; probability computation dynamic modification; quiet wide-band speech; recognition accuracy; robust speech recognition; signal to noise ratio; speech input interruptions; talker-independent digit recognition task; time-varying filtering; training; Computer vision; Degradation; Detectors; Filtering; Hidden Markov models; Low pass filters; Noise robustness; Speech analysis; Speech enhancement; Speech recognition;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659112