Title :
A Joint Factor Analysis Approach to Progressive Model Adaptation in Text-Independent Speaker Verification
Author :
Yin, Shou-Chun ; Rose, Richard ; Kenny, Patrick
Author_Institution :
McGill Univ., Montreal
Abstract :
This paper addresses the issue of speaker variability and session variability in text-independent Gaussian mixture model (GMM)-based speaker verification. A speaker model adaptation procedure is proposed which is based on a joint factor analysis approach to speaker verification. It is shown in this paper that this approach facilitates the implementation of a progressive unsupervised adaptation strategy which is able to produce an improved model of speaker identity while minimizing the influence of channel variability. The paper also deals with the interaction between this model adaptation approach and score normalization strategies which act to reduce the variation in likelihood ratio scores. This issue is particularly important in establishing decision thresholds in practical speaker verification systems since the variability of likelihood ratio scores can increase as a result of progressive model adaptation. These adaptation methods have been evaluated under the adaptation paradigm defined under the NIST 2005 Speaker Recognition Evaluation Plan, which is based on conversation sides derived from telephone speech utterances. It was found that when target speaker models were trained from a single conversation, an equal error rate (EER) of 4.5% was obtained under the NIST unsupervised speaker adaptation scenario.
Keywords :
Gaussian processes; speaker recognition; Gaussian mixture model; NIST 2005 Speaker Recognition Evaluation Plan; equal error rate; joint factor analysis approach; progressive model adaptation; progressive unsupervised adaptation strategy; score normalization strategies; session variability; speaker variability; speaker verification; text-independent speaker verification; Acoustical engineering; Adaptation model; Communication channels; Councils; Error analysis; Loudspeakers; NIST; Speaker recognition; Speech analysis; Telephony; Factor analysis; Gaussian mixture model (GMM); speaker adaptation; speaker verification;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.902410