DocumentCode :
730756
Title :
Deep neural networks for cochannel speaker identification
Author :
Xiaojia Zhao ; Yuxuan Wang ; DeLiang Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4824
Lastpage :
4828
Abstract :
Speaker identification (SID) in cochannel speech, where two speakers are talking simultaneously over a single recording channel, is a challenging problem. Previous studies address this problem in the anechoic environment under the Gaussian mixture model (GMM) framework. On the other hand, cochannel SID in reverberant conditions has not been addressed. This paper studies cochannel SID in both anechoic and reverberant conditions. We explore deep neural networks (DNNs) for cochannel SID and propose a DNN-based recognition system. Evaluation results demonstrate the proposed DNN-based system outperforms the two state-of-the-art cochannel SID systems in both anechoic and reverberant conditions and various target-to-interferer ratios.
Keywords :
neural nets; source separation; speech recognition; anechoic conditions; cochannel speaker identification; cochannel speech; deep neural networks; reverberant conditions; Accuracy; NIST; Robustness; Speech; Speech recognition; Training; Training data; Cochannel speaker identification; Gaussian mixture model; deep neural network; reverberation; target-to-interferer ratio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178887
Filename :
7178887
Link To Document :
بازگشت