DocumentCode :
734993
Title :
Local trajectory based speech enhancement for robust speech recognition with deep neural network
Author :
Yongbin You ; Yanmin Qian ; Kai Yu
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
fYear :
2015
fDate :
12-15 July 2015
Firstpage :
5
Lastpage :
9
Abstract :
Deep neural network(DNN) has achieved a great success in automatic speech recognition(ASR), and it can be regarded as a joint model combining the nonlinear feature transformation and the log-linear classifier. Recently DNN is adopted as a regression model to enhance the distorted feature in noisy condition and the enhanced feature is utilized to improve the performance of DNN based ASR. Previous work only predicts a single frame (log-spectrum) using the enhanced DNN and the final improvement of ASR is not big. In this paper, local trajectory, represented using multiple frames with dynamic features, is predicted instead to make the feature enhancement more stable. In addition, FBank features and long context window are used to better integrating the enhanced DNN into ASR DNN. Experiments on the Aurora4 corpus showed that, compared to the standard DNN baseline, the proposed approach can achieve 9.6% relative WER reduction and also significantly outperform the previously proposed DNN ASR system using the log-spectrum feature based enhancement.
Keywords :
feature extraction; neural nets; regression analysis; speech enhancement; speech recognition; ASR; DNN; automatic speech recognition; deep neural network; feature enhancement; local trajectory; log-linear classifier; nonlinear feature transformation; regression model; speech enhancement; Acoustics; Context; Hidden Markov models; Speech; Speech recognition; Training; Trajectory; deep neural network; dynamic feature; feature enhancement; noise robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location :
Chengdu
Type :
conf
DOI :
10.1109/ChinaSIP.2015.7230351
Filename :
7230351
Link To Document :
بازگشت