• DocumentCode
    1097217
  • Title

    A Joint Factor Analysis Approach to Progressive Model Adaptation in Text-Independent Speaker Verification

  • Author

    Yin, Shou-Chun ; Rose, Richard ; Kenny, Patrick

  • Author_Institution
    McGill Univ., Montreal
  • Volume
    15
  • Issue
    7
  • fYear
    2007
  • Firstpage
    1999
  • Lastpage
    2010
  • Abstract
    This paper addresses the issue of speaker variability and session variability in text-independent Gaussian mixture model (GMM)-based speaker verification. A speaker model adaptation procedure is proposed which is based on a joint factor analysis approach to speaker verification. It is shown in this paper that this approach facilitates the implementation of a progressive unsupervised adaptation strategy which is able to produce an improved model of speaker identity while minimizing the influence of channel variability. The paper also deals with the interaction between this model adaptation approach and score normalization strategies which act to reduce the variation in likelihood ratio scores. This issue is particularly important in establishing decision thresholds in practical speaker verification systems since the variability of likelihood ratio scores can increase as a result of progressive model adaptation. These adaptation methods have been evaluated under the adaptation paradigm defined under the NIST 2005 Speaker Recognition Evaluation Plan, which is based on conversation sides derived from telephone speech utterances. It was found that when target speaker models were trained from a single conversation, an equal error rate (EER) of 4.5% was obtained under the NIST unsupervised speaker adaptation scenario.
  • Keywords
    Gaussian processes; speaker recognition; Gaussian mixture model; NIST 2005 Speaker Recognition Evaluation Plan; equal error rate; joint factor analysis approach; progressive model adaptation; progressive unsupervised adaptation strategy; score normalization strategies; session variability; speaker variability; speaker verification; text-independent speaker verification; Acoustical engineering; Adaptation model; Communication channels; Councils; Error analysis; Loudspeakers; NIST; Speaker recognition; Speech analysis; Telephony; Factor analysis; Gaussian mixture model (GMM); speaker adaptation; speaker verification;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.902410
  • Filename
    4291618