مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech separation of a target speaker based on deep neural networks

DocumentCode :

231543

Title :

Speech separation of a target speaker based on deep neural networks

Author :

Jun Du ; Yanhui Tu ; Yong Xu ; Lirong Dai ; Chin-Hui Lee

Author_Institution :

Univ. of Sci. & Technol. of China, Hefei, China

fYear :

2014

fDate :

19-23 Oct. 2014

Firstpage :

473

Lastpage :

477

Abstract :

This paper proposes a novel data-driven approach based on deep neural networks (DNNs) for single-channel speech separation. DNN is adopted to directly model the highly non-linear relationship of speech features between a target speaker and the mixed signals. Both supervised and semi-supervised scenarios are investigated. In the supervised mode, both identities of the target speaker and the interfering speaker are provided. While in the semi-supervised mode, only the target speaker is given. We propose using multiple speakers to be mixed with the target speaker to train the DNN which is shown to well predict an unseen interferer in the separation stage. Experimental results demonstrate that our proposed framework achieves better separation results than a GMM-based approach in the supervised mode. More significantly, in the semi-supervised mode which is believed to be the preferred mode in real-world operations, the DNN-based approach even outperforms the GMM-based approach in the supervised mode.

Keywords :

Gaussian processes; feature extraction; mixture models; neural nets; speaker recognition; speech processing; DNN; GMM-based approach; data-driven approach; deep neural networks; interfering speaker; mixed signals; semisupervised mode; speech features; speech separation; target speaker; Hidden Markov models; Neural networks; Predictive models; Signal to noise ratio; Speech; Speech processing; Training; deep neural networks; semi-supervised mode; single-channel speech separation; supervised mode;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing (ICSP), 2014 12th International Conference on

Conference_Location :

Hangzhou

ISSN :

2164-5221

Print_ISBN :

978-1-4799-2188-1

Type :

conf

DOI :

10.1109/ICOSP.2014.7015050

Filename :

7015050

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=231543