DocumentCode :
3585057
Title :
Improving speaker recognition performance in the domain adaptation challenge using deep neural networks
Author :
Garcia-Romero, Daniel ; Xiaohui Zhang ; McCree, Alan ; Povey, Daniel
Author_Institution :
Human Language Technol. Center of Excellence & Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2014
Firstpage :
378
Lastpage :
383
Abstract :
Traditional i-vector speaker recognition systems use a Gaussian mixture model (GMM) to collect sufficient statistics (SS). Recently, replacing this GMM with a deep neural network (DNN) has shown promising results. In this paper, we explore the use of DNNs to collect SS for the unsupervised domain adaptation task of the Domain Adaptation Challenge (DAC).We show that collecting SS with a DNN trained on out-of-domain data boosts the speaker recognition performance of an out-of-domain system by more than 25%. Moreover, we integrate the DNN in an unsupervised adaptation framework, that uses agglomerative hierarchical clustering with a stopping criterion based on unsupervised calibration, and show that the initial gains of the out-of-domain system carry over to the final adapted system. Despite the fact that the DNN is trained on the out-of-domain data, the final adapted system produces a relative improvement of more than 30% with respect to the best published results on this task.
Keywords :
neural nets; pattern clustering; speaker recognition; statistical analysis; unsupervised learning; DAC; DNN; GMM; Gaussian mixture model; agglomerative hierarchical clustering; deep neural networks; domain adaptation challenge; i-vector speaker recognition systems; speaker recognition performance; stopping criterion; sufficient statistics collection; unsupervised calibration; unsupervised domain adaptation task; Acoustics; Conferences; Neural networks; Speaker recognition; Speech; Speech processing; Training; Unsupervised adaptation; deep neural networks; i-vectors; speaker recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type :
conf
DOI :
10.1109/SLT.2014.7078604
Filename :
7078604
Link To Document :
بازگشت