DocumentCode
735072
Title
A unified speaker-dependent speech separation and enhancement system based on deep neural networks
Author
Tian Gao ; Jun Du ; Li Xu ; Cong Liu ; Li-Rong Dai ; Chin-Hui Lee
Author_Institution
Univ. of Sci. & Technol. of China, Hefei, China
fYear
2015
fDate
12-15 July 2015
Firstpage
687
Lastpage
691
Abstract
Speech enhancement and speech separation are important frontends of many speech processing systems. In real tasks, the background noises are often mixed with some human voice interferences. In this paper, we explore a framework to unify speech enhancement and speech separation for a speaker-dependent scenario based on deep neural networks (DNNs). Using a supervised method, DNN is adopted to directly model a nonlinear mapping function between noisy and clean speech signals. The signals of speaker interferers are considered as one type of universal noise signals in our framework. In order to be able to handle a wide range of additive noise in the real-world situations, a large training set that encompasses many possible combinations of speech and noise types, is designed. Experimental results demonstrate that the proposed framework can get the comparable performances to those single speech enhancement or separation systems. Furthermore, the resulting DNN model, trained with artificial synthesized data, is also effective in dealing with noisy speech data recorded in real-world conditions.
Keywords
neural nets; speaker recognition; speech enhancement; DNN; additive noise; artificial synthesized data; background noises; clean speech signals; deep neural networks; human voice interferences; noise types; noisy speech data; noisy speech signals; nonlinear mapping function; speaker interferers; speaker-dependent scenario; speech processing systems; speech types; supervised method; unified speaker-dependent speech separation and enhancement system; universal noise signals; Neural networks; Noise; Noise measurement; Speech; Speech enhancement; Training; deep neural networks; speaker-dependent; speech enhancement; speech separation; supervised method;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
Conference_Location
Chengdu
Type
conf
DOI
10.1109/ChinaSIP.2015.7230492
Filename
7230492
Link To Document