DocumentCode :
3527609
Title :
Stereo-based stochastic mapping with discriminative training for noise robust speech recognition
Author :
Cui, Xiaodong ; Afify, Mohamed ; Gao, Yuqing
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3933
Lastpage :
3936
Abstract :
This paper presents an enhanced stochastic mapping technique in the discriminative feature (fMPE) space that exploits stereo data for noise robust LVCSR. Both MMSE and MAP estimates of the mapping are given and the performance of the two is investigated. Due to the iterative nature of the MAP estimate, we show that combining MMSE and MAP estimates is possible and yields superior performance than each individual estimate. A multi-style discriminative training with minimum phone error (MPE) criterion is further applied to the compensated features and obtains significant performance improvement on real-world noisy test sets.
Keywords :
learning (artificial intelligence); least mean squares methods; maximum likelihood estimation; speech recognition; stochastic processes; MAP estimation; MMSE; discriminative feature space; iterative method; minimum phone error criterion; multi style discriminative training; noise robust speech recognition; stereo-based stochastic mapping; Acoustic noise; Cepstral analysis; Gaussian noise; Linear discriminant analysis; Mel frequency cepstral coefficient; Noise robustness; Speech recognition; Stochastic resonance; Working environment noise; Yield estimation; Stereo feature; automatic speech recognition; discriminative training; noise robustness; stochastic mapping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960488
Filename :
4960488
Link To Document :
بازگشت