DocumentCode
290128
Title
Adaptation techniques for ambience and microphone compensation in the IBM Tangora speech recognition system
Author
Das, Subrata ; Nádas, Arthur ; Nahamoo, David ; Picheny, Michael
Author_Institution
Dept. of Comput. Sci., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
i
fYear
1994
fDate
19-22 Apr 1994
Abstract
Conventional speech recognition systems such as the IBM Tangora tend to be adversely influenced by external factors such as background noise and microphone characteristics. This paper discusses some adaptation techniques to counteract such influences. We also consider ways to reduce computation so that the methods may be implemented for real time Tangora operation. The adaptation strategy utilized two sets of VQ codebooks derived from the training data, one representing the ambience and the other characterizing the speech domain. A speech versus ambience decision was made by examining several factors, such as a comparison of the overall energy in a frame with a percentile point of a running energy histogram. We include results to demonstrate the effectiveness of our approach
Keywords
adaptive signal processing; microphones; speech coding; speech recognition; vector quantisation; IBM Tangora; VQ codebooks; adaptation techniques; ambience; background noise; frame energy; microphone characteristics; microphone compensation; running energy histogram; speech domain; speech recognition system; training data; Background noise; Databases; Histograms; Microphones; Noise level; Noise robustness; Speech enhancement; Speech recognition; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389365
Filename
389365
Link To Document