Title :
A study on bias-based speech signal conditioning techniques for improving the robustness of automatic speech recognition
Author :
Chowdhury, Md Foezur Rahman ; Selouani, Sid-Ahmed ; Shaughnessy, Douglas O.
Author_Institution :
INRS - EMT, Univ. du Quebec, Quebec City, QC
Abstract :
Automatic speech recognition (ASR) performs poorly when the training conditions greatly mismatch the testing conditions. Additive background noises and channel distortion are responsible mostly for these mismatches. These mismatches introduce highly non-linear terms in the acoustic model of speech in both log-spectral and the cepstral domains. Current ASR is based on simple linear approximation of these non-linear functions in the cepstral domain in order to avoid model complexities. This linear modeling approach transforms the channel distortion into an additive bias term in the cepstral domain under the assumption of high SNR, which is barely true in practical situations. Several algorithms have been developed to estimate this bias term and make compensations either in feature space or in the model domain to improve the robustness of ASR. In this paper, we explore these bias estimate techniques for both stationary and non-stationary acoustic environments to find their applicability for self-adaptable ASR.
Keywords :
distortion; nonlinear functions; speech recognition; additive background noises; automatic speech recognition; bias-based speech signal conditioning techniques; cepstral domains; channel distortion; linear approximation; log-spectral domain; nonlinear functions; Acoustic distortion; Acoustic testing; Automatic speech recognition; Automatic testing; Background noise; Cepstral analysis; Noise robustness; Nonlinear acoustics; Nonlinear distortion; Performance evaluation; Automatic speech recognition; additive noise; channel bias; joint bias removal; robust ASR; self-adaptable ASR;
Conference_Titel :
Electrical and Computer Engineering, 2009. CCECE '09. Canadian Conference on
Conference_Location :
St. John´s, NL
Print_ISBN :
978-1-4244-3509-8
Electronic_ISBN :
0840-7789
DOI :
10.1109/CCECE.2009.5090212