DocumentCode :
2926449
Title :
Environmental robustness in automatic speech recognition
Author :
Acero, Alejandro ; Stern, Richard
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
1990
fDate :
3-6 Apr 1990
Firstpage :
849
Abstract :
Initial efforts to make Sphinx, a continuous-speech speaker-independent recognition system, robust to changes in the environment are reported. To deal with differences in noise level and spectral tilt between close-talking and desk-top microphones, two novel methods based on additive corrections in the cepstral domain are proposed. In the first algorithm, the additive correction depends on the instantaneous SNR of the signal. In the second technique, expectation-maximization techniques are used to best match the cepstral vectors of the input utterances to the ensemble of codebook entries representing a standard acoustical ambience. Use of the algorithms dramatically improves recognition accuracy when the system is tested on a microphone other than the one on which it was trained
Keywords :
interference suppression; microphones; speech recognition; Sphinx; additive correction; cepstral domain; close-talking microphones; continuous-speech speaker-independent recognition system; desk-top microphones; expectation-maximization techniques; instantaneous SNR; noise level; recognition accuracy; spectral tilt; Acoustic testing; Additive noise; Automatic speech recognition; Cepstral analysis; Code standards; Microphones; Noise level; Noise robustness; Signal to noise ratio; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1990.115971
Filename :
115971
Link To Document :
بازگشت