Title :
Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition
Author :
Cui, Xiaodong ; Afify, Mohamed ; Zhou, Bowen
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
In this paper we investigate stereo-based stochastic mapping (SSM) with context for the noise robustness of automatic speech recognition, especially under unseen conditions. Probabilistic PCA (PPCA) is used in the SSM framework to reduce the high dimensionality of the noisy speech features with context and derive an eigen representation in the noisy feature space for the prediction of clean features. To reduce the computational cost in training, an approximation by single-pass re-training is considered for the estimation of joint GMM. We also show that the SSM estimate under the minimum mean square error (MMSE) in a space where low dimensional representation of clean speech and uncorrelated additive noise can be assumed is related to the subspace speech enhancement. Experiments on large vocabulary continuous speech recognition tasks observe gains from the proposed approach under the conditions with seen, unseen and real noise.
Keywords :
eigenvalues and eigenfunctions; mean square error methods; principal component analysis; probability; speech enhancement; speech recognition; SSM; automatic speech recognition; computational cost; eigenrepresentation; minimum mean square error; noise robustness; noisy feature space; noisy speech features; probabilistic principal component analysis; single pass retraining; stereo based stochastic mapping; subspace speech enhancement; uncorrelated additive noise; vocabulary continuous speech recognition; Abstracts; Speech enhancement; LVCSR; noise robustness; probabilistic PCA; stereo-based stochastic mapping; subspace speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288969