DocumentCode :
1690490
Title :
Noise adaptive front-end normalization based on Vector Taylor Series for Deep Neural Networks in robust speech recognition
Author :
Bo Li ; Khe Chai Sim
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
fYear :
2013
Firstpage :
7408
Lastpage :
7412
Abstract :
Deep Neural Networks (DNNs) have been successfully applied to various speech tasks during recent years. In this paper, we investigate the use of DNNs for noise-robust speech recognition and demonstrate their superior capabilities of modeling acoustic variations over the conventional Gaussian Mixture Models (GMMs). We then propose to compensate the normalization front-end of the DNNs using the GMM-based Vector Taylor Series (VTS) model compensation technique, which has been successfully applied in the GMM-based ASR systems to handle noisy speech. To fully benefit from both the powerful modeling capability of the DNN and the effective noise compensation of the VTS, an adaptive training algorithm is further developed. The preliminary experimental results on the AURORA 2 task have demonstrated the effectiveness of our approach. The adaptively trained system has been shown to outperform the GMM-based VTS adaptive training by relatively 18.8% using the MFCC features and 21.9% using the FBank features.
Keywords :
Gaussian processes; learning (artificial intelligence); neural nets; speech recognition; DNN; GMM-based ASR systems; Gaussian mixture models; acoustic variations modeling; adaptive training algorithm; deep neural networks; noise adaptive front-end normalization; noisy speech; robust speech recognition; vector Taylor series; Acoustic distortion; Adaptation models; Noise; Speech; Speech recognition; Training; Training data; Deep Neural Networks; Noise Robustness; Vector Taylor Series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639102
Filename :
6639102
Link To Document :
بازگشت