Title :
On the temporal decorrelation of feature parameters for noise-robust speech recognition
Author :
Jung, Ho-Young ; Lee, Soo-Young
Author_Institution :
Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Seoul, South Korea
fDate :
7/1/2000 12:00:00 AM
Abstract :
We propose a new frame decorrelation method for robust speech recognition in noisy environments. In most cases, signal perturbation is caused by channel distortion and additive background noise, and can be modeled as a slowly varying term in either the log spectral or the linear-spectral domains. Thus, it is effective to deemphasize slowly varying stationary components in the spectral feature domain of speech signals, which can be considered as a temporal decorrelation process. The proposed method presents a well structured high-pass filter using the decorrelation principle, and provides some significant insights into existing high-pass approaches, such as relative spectral (RASTA) processing. The performance of the proposed method was evaluated by speaker-independent isolated-word recognition experiments using the hidden Markov model (HMM). Noisy speech was simulated by adding noise sources taken from the Noisex-92 database. Experimental results showed that the proposed method was effective for speech recognition with significant noise and yielded better performance than other high-pass methods. In addition, we compared the dynamic property of the proposed filter with that of delta features. The feature obtained by the proposed method may offer most of the delta feature property
Keywords :
decorrelation; feature extraction; filtering theory; hidden Markov models; high-pass filters; noise; parameter estimation; spectral analysis; speech recognition; HMM; Noisex-92 database; RASTA processing; additive background noise; channel distortion; delta features; dynamic property; experimental results; feature parameters; frame decorrelation method; hidden Markov model; high-pass filter; linear-spectral domain; log spectral domain; noise sources; noise-robust speech recognition; noisy environments; noisy speech simulation; performance evaluation; relative spectral processing; signal perturbation; slowly varying stationary components; speaker-independent isolated-word recognition; spectral feature domain; speech signals; temporal decorrelation; Additive noise; Background noise; Decorrelation; Distortion; Filters; Hidden Markov models; Noise robustness; Speech processing; Speech recognition; Working environment noise;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on