Title :
Discriminative approach to dynamic variance adaptation for noisy speech recognition
Author :
Delcroix, Marc ; Watanabe, Shinji ; Nakatani, Tomohiro ; Nakamura, Atsushi
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
fDate :
May 30 2011-June 1 2011
Abstract :
The performance of automatic speech recognition suffers from severe degradation in the presence of noise or reverberation. One conventional approach for handling such acoustic distortions is to use a speech enhancement technique prior to recognition. However, most speech enhancement techniques introduce artifacts that create a mismatch between the enhanced speech features and the acoustic model used for recognition, therefore limiting the improvement in recognition performance. Recently, there has been increased interest in methods capable of compensating for such a mismatch by accounting for the feature variance during decoding. In this paper, we propose to estimate the feature variance using an adaptation technique based on a discriminative criterion. In an experiment using the Aurora2 database, the proposed method could achieve significant digit error rate reduction compared with a spectral subtraction pre-processor, and using a discriminative criterion for adaptation provided further improvement compared with maximum likelihood estimation.
Keywords :
maximum likelihood estimation; speech coding; speech enhancement; speech recognition; acoustic distortion; decoding; digit error rate reduction; discriminative approach; dynamic variance adaptation; feature variance; maximum likelihood estimation; noisy speech recognition; spectral subtraction preprocessor; speech enhancement technique; Acoustics; Hidden Markov models; Noise; Noise measurement; Speech; Speech enhancement; Speech recognition; MMI; Model Adaptation; Noise reduction; Robust ASR; Variance Compensation;
Conference_Titel :
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2011 Joint Workshop on
Conference_Location :
Edinburgh
Print_ISBN :
978-1-4577-0997-5
DOI :
10.1109/HSCMA.2011.5942414