• DocumentCode
    1330985
  • Title

    Speaker verification using mixture decomposition discrimination

  • Author

    Sukkar, Rafid A. ; Gandhi, Malan B. ; Setlur, Anand R.

  • Author_Institution
    Bell Labs., Lucent Technol., Naperville, IL, USA
  • Volume
    8
  • Issue
    3
  • fYear
    2000
  • fDate
    5/1/2000 12:00:00 AM
  • Firstpage
    292
  • Lastpage
    299
  • Abstract
    A new approach for speaker verification is presented. Mixture decomposition discrimination (MDD) is based on the idea that, when modeling speech using speaker independent continuous density hidden Markov models (HMM), different speakers speaking the same word would cause different HMM mixture components to dominate. When the mixture information is considered, one can construct a “mixture profile” of a speaker speaking a given word or phrase. This mixture profile is incorporated into a discriminative training procedure to discriminate between a true speaker and all other speakers (or imposters). The effectiveness of MDD is seen when it is incorporated into a hybrid verification system that also includes speaker dependent HMM modeling with cohort normalization. Experimental results show that the hybrid system reduces the average equal error rate (EER) by 46% when compared with the EER of the speaker-dependent HMM verifier. It is also shown that the computational and model storage requirements needed to incorporate MDD into the hybrid system are relatively small
  • Keywords
    error statistics; hidden Markov models; speaker recognition; HMM mixture components; average equal error rate; cohort normalization; computational requirements; continuous density hidden Markov models; discriminative training; experimental results; hybrid verification system; imposters; mixture decomposition discrimination; mixture information; mixture profile; speaker dependent HMM; speaker independent continuous density HMM; speaker verification; speech modeling; storage requirements; true speaker; Computational complexity; Computational modeling; Error analysis; Hidden Markov models; Multilayer perceptrons; Speech recognition; Testing; Vectors; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.841211
  • Filename
    841211