DocumentCode :
394326
Title :
A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech
Author :
Beh, Jounghoon ; Ko, Hanseok
Author_Institution :
Departments of Electron. & Comput. Eng., Korea Univ., Seoul, South Korea
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
This paper addresses a novel noise-compensation scheme to solve the mismatch problem between training condition and testing condition for an automatic speech recognition (ASR) system, specifically in in-car environments. The conventional spectral subtraction schemes rely on the signal to noise ratio (SNR) such that attenuation is imposed on that part of the spectrum that appears to have low SNR, and accentuation is made on that part of high SNR. However, these schemes are based on the postulation that the power spectrum of noise is in general at the lower level in magnitude than that of speech. Therefore, while such postulation is adequate for high SNR environment, it is grossly inadequate for low SNR scenarios such as the in-car environment. This paper proposes an efficient spectral subtraction scheme focused to specifically low SNR noisy environments by distinguishing the speech-dominant segment from the noise-dominant segment in the speech spectrum. Representative experiments confirm the superior performance of the proposed method over conventional methods. The experiments are conducted using car noise-corrupted utterances of the Aurora2 corpus.
Keywords :
acoustic noise; spectral analysis; speech recognition; ASR system; automatic speech recognition; in-car environments; low SNR environments; mismatch problem; noise-compensation scheme; noise-dominant segment; performance; robust speech recognition; spectral speech harmonics; spectral subtraction scheme; speech-dominant segment; Attenuation; Automatic speech recognition; Automatic testing; Noise level; Noise robustness; Signal to noise ratio; Speech enhancement; Speech recognition; System testing; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198864
Filename :
1198864
Link To Document :
بازگشت