Title :
Deep neural networks for cochannel speaker identification
Author :
Xiaojia Zhao ; Yuxuan Wang ; DeLiang Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
Speaker identification (SID) in cochannel speech, where two speakers are talking simultaneously over a single recording channel, is a challenging problem. Previous studies address this problem in the anechoic environment under the Gaussian mixture model (GMM) framework. On the other hand, cochannel SID in reverberant conditions has not been addressed. This paper studies cochannel SID in both anechoic and reverberant conditions. We explore deep neural networks (DNNs) for cochannel SID and propose a DNN-based recognition system. Evaluation results demonstrate the proposed DNN-based system outperforms the two state-of-the-art cochannel SID systems in both anechoic and reverberant conditions and various target-to-interferer ratios.
Keywords :
neural nets; source separation; speech recognition; anechoic conditions; cochannel speaker identification; cochannel speech; deep neural networks; reverberant conditions; Accuracy; NIST; Robustness; Speech; Speech recognition; Training; Training data; Cochannel speaker identification; Gaussian mixture model; deep neural network; reverberation; target-to-interferer ratio;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178887