Title :
Stochastic features for noise robust speech recognition
Author :
Iwahashi, N. ; Pao, H. ; Honda, H. ; Minamino, K. ; Omote, M.
Author_Institution :
Sony Corp., Tokyo, Japan
Abstract :
This paper describes a novel technique for noise robust speech recognition, which can incorporate the characteristics of noise distribution directly in features. The feature itself of each analysis frame has a stochastic form, which can represent the probability density function of the estimated speech component in the noisy speech. Using the sequence of the probability density functions of the estimated speech components and hidden Markov modelling of clean speech, the observation probability of the noisy speech is calculated. In the whole process of the technique, the explicit information on the SNR is not used. The technique is evaluated by large vocabulary isolated word recognition under car noise environment, and is found to have clearly outperformed nonlinear spectral subtraction (with between 13% and 44% reduction in recognition errors)
Keywords :
automobiles; cepstral analysis; error statistics; feature extraction; hidden Markov models; noise; probability; speech recognition; stochastic processes; analysis frame; car noise environment; cepstral domain; clean speech; estimated speech component; estimated speech components; hidden Markov modelling; large vocabulary isolated word recognition; noise distribution; noise robust speech recognition; noisy speech; nonlinear spectral subtraction; observation probability; probability density function; recognition error reduction; stochastic feature extraction; Hidden Markov models; Noise reduction; Noise robustness; Probability density function; Signal to noise ratio; Speech analysis; Speech recognition; Stochastic resonance; Vocabulary; Working environment noise;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675344