DocumentCode :
231561
Title :
Deep neural network based speech separation for robust speech recognition
Author :
Yanhui Tu ; Jun Du ; Yong Xu ; Lirong Dai ; Chin-Hui Lee
Author_Institution :
Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2014
fDate :
19-23 Oct. 2014
Firstpage :
532
Lastpage :
536
Abstract :
In this paper, a novel deep neural network (DNN) architecture is proposed to generate the speech features of both the target speaker and interferer for speech separation without using any prior information about the interfering speaker. DNN is adopted here to directly model the highly nonlinear relationship between speech features of the mixed signals and the two competing speakers. Experimental results on a monaural speech separation and recognition challenge task show that the proposed DNN framework enhances the separation performance in terms of different objective measures under the semi-supervised mode where the training data of the target speaker is provided while the unseen interferer in the separation stage is predicted by using multiple interfering speakers mixed with the target speaker in the training stage. Furthermore, as a preprocessing step in the testing stage for robust speech recognition, our speech separation approach can achieve significant improvements of the recognition accuracy over the baseline system with no source separation.
Keywords :
feature extraction; neural nets; speech recognition; DNN architecture; baseline system; deep neural network based speech separation; interfering speaker; monaural speech separation; recognition challenge task; robust speech recognition; semisupervised mode; speech features; target speaker; training data; training stage; Hidden Markov models; Neural networks; Robustness; Speech; Speech processing; Speech recognition; Training; deep neural networks; robust speech recognition; semi-supervised mode; single-channel speech separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2014 12th International Conference on
Conference_Location :
Hangzhou
ISSN :
2164-5221
Print_ISBN :
978-1-4799-2188-1
Type :
conf
DOI :
10.1109/ICOSP.2014.7015061
Filename :
7015061
Link To Document :
بازگشت