Title :
Adaptation techniques for ambience and microphone compensation in the IBM Tangora speech recognition system
Author :
Das, Subrata ; Nádas, Arthur ; Nahamoo, David ; Picheny, Michael
Author_Institution :
Dept. of Comput. Sci., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Conventional speech recognition systems such as the IBM Tangora tend to be adversely influenced by external factors such as background noise and microphone characteristics. This paper discusses some adaptation techniques to counteract such influences. We also consider ways to reduce computation so that the methods may be implemented for real time Tangora operation. The adaptation strategy utilized two sets of VQ codebooks derived from the training data, one representing the ambience and the other characterizing the speech domain. A speech versus ambience decision was made by examining several factors, such as a comparison of the overall energy in a frame with a percentile point of a running energy histogram. We include results to demonstrate the effectiveness of our approach
Keywords :
adaptive signal processing; microphones; speech coding; speech recognition; vector quantisation; IBM Tangora; VQ codebooks; adaptation techniques; ambience; background noise; frame energy; microphone characteristics; microphone compensation; running energy histogram; speech domain; speech recognition system; training data; Background noise; Databases; Histograms; Microphones; Noise level; Noise robustness; Speech enhancement; Speech recognition; Testing; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389365