DocumentCode :
744154
Title :
Cochannel Speaker Identification in Anechoic and Reverberant Conditions
Author :
Xiaojia Zhao ; Yuxuan Wang ; DeLiang Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Volume :
23
Issue :
11
fYear :
2015
Firstpage :
1727
Lastpage :
1736
Abstract :
Speaker identification (SID) in cochannel speech, where two speakers are talking simultaneously over a single recording channel, is a challenging problem. Previous studies address this problem in the anechoic environment under the Gaussian mixture model (GMM) framework. On the other hand, cochannel SID in reverberant conditions has not been addressed. This paper studies cochannel SID in both anechoic and reverberant conditions. We first investigate GMM-based approaches and propose a combined system that integrates two cochannel SID methods. Second, we explore deep neural networks (DNNs) for cochannel SID and propose a DNN-based recognition system. Evaluation results demonstrate that our proposed systems significantly improve SID performance over recent approaches in both anechoic and reverberant conditions and various target-to-interferer ratios.
Keywords :
Gaussian processes; anechoic chambers (acoustic); mixture models; neural nets; reverberation chambers; speaker recognition; DNN; GMM framework; Gaussian mixture model; SID; anechoic conditions; anechoic environment; cochannel SID; cochannel SID methods; cochannel speaker identification; cochannel speech; deep neural networks; reverberant conditions; single recording channel; Gaussian mixture model; Hidden Markov models; IEEE transactions; Speech; Speech processing; Speech recognition; Cochannel speaker identification; Gaussian mixture model (GMM); deep neural network (DNN); reverberation; target-to-interferer ratio;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2447284
Filename :
7128346
Link To Document :
بازگشت