Stochastic features for noise robust speech recognition

Author

Iwahashi, N. ; Pao, H. ; Honda, H. ; Minamino, K. ; Omote, M.

Author_Institution

Sony Corp., Tokyo, Japan

Volume

2

fYear

1998

fDate

12-15 May 1998

Firstpage

633

Abstract

This paper describes a novel technique for noise robust speech recognition, which can incorporate the characteristics of noise distribution directly in features. The feature itself of each analysis frame has a stochastic form, which can represent the probability density function of the estimated speech component in the noisy speech. Using the sequence of the probability density functions of the estimated speech components and hidden Markov modelling of clean speech, the observation probability of the noisy speech is calculated. In the whole process of the technique, the explicit information on the SNR is not used. The technique is evaluated by large vocabulary isolated word recognition under car noise environment, and is found to have clearly outperformed nonlinear spectral subtraction (with between 13% and 44% reduction in recognition errors)

Keywords

automobiles; cepstral analysis; error statistics; feature extraction; hidden Markov models; noise; probability; speech recognition; stochastic processes; analysis frame; car noise environment; cepstral domain; clean speech; estimated speech component; estimated speech components; hidden Markov modelling; large vocabulary isolated word recognition; noise distribution; noise robust speech recognition; noisy speech; nonlinear spectral subtraction; observation probability; probability density function; recognition error reduction; stochastic feature extraction; Hidden Markov models; Noise reduction; Noise robustness; Probability density function; Signal to noise ratio; Speech analysis; Speech recognition; Stochastic resonance; Vocabulary; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

Conference_Location

Seattle, WA

ISSN

1520-6149

Print_ISBN

0-7803-4428-6

Type

conf

DOI

10.1109/ICASSP.1998.675344

Filename

675344