DocumentCode :
1749666
Title :
Towards non-stationary model-based noise adaptation for large vocabulary speech recognition
Author :
Kristjansson, T. ; Frey, B. ; Deng, L. ; Acero, A.
Author_Institution :
Dept. of Comput. Sci., Waterloo Univ., Ont., Canada
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
337
Abstract :
Recognition rates of speech recognition systems are known to degrade substantially when there is a mismatch between training and deployment environments. One approach to tackling this problem is to transform the acoustic models based on the channel distortion and noise characteristics of the new environment. Currently, most model adaptation strategies assume that the noise characteristics are stationary. We present results for using multiple noise distributions for the Whisper large vocabulary speech recognition system. The vector Taylor series method for adaptation of the distributions is used, and either a weighted average of the noise states or the locally best noise states is used. Our results indicate that for certain types of noise, significant gains in recognition accuracy can be achieved
Keywords :
Gaussian distribution; cepstral analysis; hidden Markov models; noise; series (mathematics); speech recognition; vectors; Whisper; acoustic models; channel distortion; large vocabulary speech recognition; model adaptation; noise characteristics; nonstationary model-based noise adaptation; recognition accuracy; recognition rates; vector Taylor series; Acoustic noise; Adaptation model; Background noise; Frequency; Speech enhancement; Speech recognition; Taylor series; Transfer functions; Vocabulary; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940836
Filename :
940836
Link To Document :
بازگشت