DocumentCode :
3329998
Title :
Nonlinear noise compensation in feature domain for speech recognition with numerical methods
Author :
Jiang, Hui ; Wang, Qi
Author_Institution :
Dept. of Comput. Sci., York Univ., Toronto, Ont., Canada
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
In this paper, we propose to compensate noise in the log-spectral domain for robust speech recognition based on a nonlinear environmental model. In our approach, starting from the original nonlinear speech distortion model in the feature domain, we derive the minimum mean square error (MMSE) estimation of clean speech signal given a noisy observation, which turns out to be an integral of a complex nonlinear function. In this work, we propose to use a numerical method to solve the above nonlinear integral. It requires higher computational complexity than the normal linear approximation methods but it is usually affordable since calculation is performed entirely in the pre-processing feature domain without involving any change in speech decoders. Experimental results show that the proposed nonlinear method outperforms the conventional vector Taylor series (VTS) method in terms of ASR performance when dealing with artificial white Gaussian noise as well as true hands-free noisy speech, especially in low SNR levels.
Keywords :
AWGN; feature extraction; integral equations; least mean squares methods; nonlinear distortion; nonlinear estimation; spectral analysis; speech recognition; ASR performance; MMSE estimation; artificial white Gaussian noise; computational complexity; feature domain; hands-free noisy speech; log-spectral domain; minimum mean square error; nonlinear environmental model; nonlinear integral; nonlinear noise compensation; nonlinear speech distortion model; pre-processing feature domain; robust speech recognition; Computational complexity; Decoding; Estimation error; Gaussian noise; Linear approximation; Mean square error methods; Noise robustness; Nonlinear distortion; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326153
Filename :
1326153
Link To Document :
بازگشت