DocumentCode :
3165113
Title :
Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition
Author :
Cui, Xiaodong ; Afify, Mohamed ; Zhou, Bowen
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4705
Lastpage :
4708
Abstract :
In this paper we investigate stereo-based stochastic mapping (SSM) with context for the noise robustness of automatic speech recognition, especially under unseen conditions. Probabilistic PCA (PPCA) is used in the SSM framework to reduce the high dimensionality of the noisy speech features with context and derive an eigen representation in the noisy feature space for the prediction of clean features. To reduce the computational cost in training, an approximation by single-pass re-training is considered for the estimation of joint GMM. We also show that the SSM estimate under the minimum mean square error (MMSE) in a space where low dimensional representation of clean speech and uncorrelated additive noise can be assumed is related to the subspace speech enhancement. Experiments on large vocabulary continuous speech recognition tasks observe gains from the proposed approach under the conditions with seen, unseen and real noise.
Keywords :
eigenvalues and eigenfunctions; mean square error methods; principal component analysis; probability; speech enhancement; speech recognition; SSM; automatic speech recognition; computational cost; eigenrepresentation; minimum mean square error; noise robustness; noisy feature space; noisy speech features; probabilistic principal component analysis; single pass retraining; stereo based stochastic mapping; subspace speech enhancement; uncorrelated additive noise; vocabulary continuous speech recognition; Abstracts; Speech enhancement; LVCSR; noise robustness; probabilistic PCA; stereo-based stochastic mapping; subspace speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288969
Filename :
6288969
Link To Document :
بازگشت