Title :
A unified compensation approach for speech recognition in severely adverse environment
Author :
Tian, Bin ; Sun, Mingui ; Sclabassi, Robert J. ; Yi, Kechu
Author_Institution :
State Key Lab. of Integrated Services Networks, Xidian Univ., Xi´´an
Abstract :
The performance of speech recognition systems is greatly degraded in adverse environments, such as in heavy noise, and in articulation variation adaptively made by the speaker. This variation gives rise to the Lombard effect characterized with changes in loudness, speed, emphasis, and clarity of speech. This paper presents a unified approach for speech recognition in adverse environments based on hidden Markov model (HMM) to compensate the additive noise and the Lombard effect. Unlike the existing methods which alleviate additive noise in recognition phase, we design a spectral addition algorithm in the training phase to compensate for the additive noise. Taking both the variance of Mel-scaled frequency cepstrum coefficients (MFCC) and the duration of different HMM states in different acoustic units into consideration, we compensate the Lombard effect by HMM state labeling. Experiments show great robustness in severe adverse environment without sacrificing the performance under normal environment. Since the compensation for noise and the Lombard effects is made in the training phase, it also greatly reduces the computational complexity in the recognition phased
Keywords :
acoustic noise; cepstral analysis; hidden Markov models; learning (artificial intelligence); speech; speech recognition; HMM state labeling; Lombard effect; Mel-scaled frequency cepstrum coefficients; additive noise; adverse environment; articulation variation; hidden Markov model; spectral addition algorithm; speech recognition system; unified compensation; Additive noise; Algorithm design and analysis; Cepstrum; Degradation; Hidden Markov models; Labeling; Mel frequency cepstral coefficient; Noise robustness; Speech recognition; Working environment noise;
Conference_Titel :
Uncertainty Modeling and Analysis, 2003. ISUMA 2003. Fourth International Symposium on
Conference_Location :
College Park, MD
Print_ISBN :
0-7695-1997-0
DOI :
10.1109/ISUMA.2003.1236171