Title :
Speech recognition using noise-adaptive prototypes
Author :
Nádas, Arthur ; Nahamoo, David ; Picheny, Michael A.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
A probabilistic mixture model is described for a frame (the short-term spectrum) of each to be used in speech recognition. Each component of the mixture is regarded as a prototype for the labeling phase of a hidden Markov model based speech recognition system. Since the ambient noise during recognition can differ from the ambient noise present in the training data, the model is designed for convenient updating in changing noise. Based on the observation that the energy in a frequency band is at any fixed time dominated either by signal energy or by noise energy, the authors model the energy as the larger of the separate energies of signal and noise in the band. Statistical algorithms are given for training this as a hidden variables model. The hidden variables are the prototype identities and the separate signal and noise components. A series of speech recognition experiments that successfully utilize this model is also discussed
Keywords :
Markov processes; acoustic noise; speech recognition; ambient noise; frequency band; hidden Markov model; hidden variables model; noise energy; noise-adaptive prototypes; probabilistic mixture model; short-term spectrum; signal energy; speech frame; speech recognition; statistical algorithms; training data; Acoustic applications; Acoustic noise; Hidden Markov models; Phase noise; Prototypes; Signal processing; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location :
New York, NY
DOI :
10.1109/ICASSP.1988.196633