Speaker verification using mixture decomposition discrimination

Author

Sukkar, Rafid A. ; Gandhi, Malan B. ; Setlur, Anand R.

Author_Institution

Bell Labs., Lucent Technol., Naperville, IL, USA

Volume

8

Issue

3

fYear

2000

fDate

5/1/2000 12:00:00 AM

Firstpage

292

Lastpage

299

Abstract

A new approach for speaker verification is presented. Mixture decomposition discrimination (MDD) is based on the idea that, when modeling speech using speaker independent continuous density hidden Markov models (HMM), different speakers speaking the same word would cause different HMM mixture components to dominate. When the mixture information is considered, one can construct a “mixture profile” of a speaker speaking a given word or phrase. This mixture profile is incorporated into a discriminative training procedure to discriminate between a true speaker and all other speakers (or imposters). The effectiveness of MDD is seen when it is incorporated into a hybrid verification system that also includes speaker dependent HMM modeling with cohort normalization. Experimental results show that the hybrid system reduces the average equal error rate (EER) by 46% when compared with the EER of the speaker-dependent HMM verifier. It is also shown that the computational and model storage requirements needed to incorporate MDD into the hybrid system are relatively small

Keywords

error statistics; hidden Markov models; speaker recognition; HMM mixture components; average equal error rate; cohort normalization; computational requirements; continuous density hidden Markov models; discriminative training; experimental results; hybrid verification system; imposters; mixture decomposition discrimination; mixture information; mixture profile; speaker dependent HMM; speaker independent continuous density HMM; speaker verification; speech modeling; storage requirements; true speaker; Computational complexity; Computational modeling; Error analysis; Hidden Markov models; Multilayer perceptrons; Speech recognition; Testing; Vectors; Vocabulary;

fLanguage

English

Journal_Title

Speech and Audio Processing, IEEE Transactions on

Publisher

ieee

ISSN

1063-6676

Type

jour

DOI

10.1109/89.841211

Filename

841211