• DocumentCode
    2697444
  • Title

    Experiments in Speaker Adaptation for Factor Analysis Based Speaker Verification

  • Author

    Yin, Shou-Chun ; Kenny, Patrick ; Rose, Richard

  • Author_Institution
    Centre de Recherche Informatoque de Montreal, Que.
  • fYear
    2006
  • fDate
    28-30 June 2006
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents methods for supervised and unsupervised speaker adaptation of Gaussian mixture speaker models in text-independent speaker verification. The methods are based on an approach which is able to decompose speaker and channel variability so that progressive updating of speaker models can be performed while minimizing the influence of the channel variability associated with the adaptation utterances. This approach relies on a joint factor analysis model of intrinsic speaker variability and session variability where inter-session variation is assumed to result primarily from the effects of the channel. These adaptation methods have been evaluated under the adaptation paradigm defined under the NIST 2005 speaker recognition evaluation plan which is based on conversational telephone speech. It was found that when both target speaker model training and speaker verification trials were performed using a five minute excerpt from a single conversation, an equal error rate (EER) of 4.5% and minimum detection cost function (DCF) of 0.013 were obtained when performing unsupervised speaker adaptation during evaluation. It will be shown that this performance is comparable to that obtained by state of the art speaker verification systems that rely on a larger set of features and are trained from as many as eight conversations from the target speaker
  • Keywords
    Gaussian processes; error statistics; speaker recognition; DCF; EER; GMM; Gaussian mixture model; NIST 2005 speaker recognition evaluation; channel variability; conversational telephone speech; equal error rate; factor analysis; minimum detection cost function; speaker adaptation; text-independent speaker verification; Adaptation model; Aging; Cost function; Error analysis; NIST; Performance evaluation; Speaker recognition; Speech analysis; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
  • Conference_Location
    San Juan
  • Print_ISBN
    1-424400471-1
  • Electronic_ISBN
    1-4244-0472-X
  • Type

    conf

  • DOI
    10.1109/ODYSSEY.2006.248130
  • Filename
    4013547