مرکز منطقه ای اطلاع رساني علوم و فناوري - Regularized Auto-Associative Neural Networks for Speaker Verification

DocumentCode :

1304453

Title :

Regularized Auto-Associative Neural Networks for Speaker Verification

Author :

Garimella, Sri ; Mallidi, Sri Harish ; Hermansky, Hynek

Author_Institution :

ECE Dept., Johns Hopkins Univ., Baltimore, MD, USA

Volume :

Issue :

fYear :

2012

Firstpage :

841

Lastpage :

844

Abstract :

Auto-Associative Neural Network (AANN) is a fully connected feed-forward neural network, trained to reconstruct its input at its output through a hidden compression layer. AANNs are used to model speakers in speaker verification, where a speaker-specific AANN model is obtained by adapting (or retraining) the Universal Background Model (UBM) AANN, an AANN trained on multiple held out speakers, using corresponding speaker data. When the amount of speaker data is limited, this adaptation procedure leads to overfitting. Additionally, the resultant speaker-specific parameters become noisy due to outliers in data. Thus, we propose to regularize the parameters of an AANN during speaker adaptation. A closed-form expression for updating the parameters is derived. Further, these speaker-specific AANN parameters are directly used as features in linear discriminant analysis (LDA)/probabilistic discriminant (PLDA) analysis based speaker verification system. The proposed speaker verification system outperforms the previously proposed weighted least squares (WLS) based AANN speaker verification system on NIST-08 speaker recognition evaluation (SRE). Moreover, the proposed speaker verification system obviates the need for an intermediate dimensionality reduction (or i-vector extraction) step.

Keywords :

feedforward neural nets; probability; speaker recognition; AANN; NIST-08 speaker recognition evaluation; closed-form expression; feed-forward neural network; intermediate dimensionality reduction; linear discriminant analysis; probabilistic discriminant analysis; regularized auto-associative neural network; speaker adaptation; speaker verification system; speaker-specific parameter; universal background model; Adaptation models; Data models; Feature extraction; Mel frequency cepstral coefficient; Neural networks; Training; Vectors; Adaptation; auto-associative neural network; regularization; speaker verification;

fLanguage :

English

Journal_Title :

Signal Processing Letters, IEEE

Publisher :

ieee

ISSN :

1070-9908

Type :

jour

DOI :

10.1109/LSP.2012.2221706

Filename :

6319350

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1304453