DocumentCode :
1330985
Title :
Speaker verification using mixture decomposition discrimination
Author :
Sukkar, Rafid A. ; Gandhi, Malan B. ; Setlur, Anand R.
Author_Institution :
Bell Labs., Lucent Technol., Naperville, IL, USA
Volume :
8
Issue :
3
fYear :
2000
fDate :
5/1/2000 12:00:00 AM
Firstpage :
292
Lastpage :
299
Abstract :
A new approach for speaker verification is presented. Mixture decomposition discrimination (MDD) is based on the idea that, when modeling speech using speaker independent continuous density hidden Markov models (HMM), different speakers speaking the same word would cause different HMM mixture components to dominate. When the mixture information is considered, one can construct a “mixture profile” of a speaker speaking a given word or phrase. This mixture profile is incorporated into a discriminative training procedure to discriminate between a true speaker and all other speakers (or imposters). The effectiveness of MDD is seen when it is incorporated into a hybrid verification system that also includes speaker dependent HMM modeling with cohort normalization. Experimental results show that the hybrid system reduces the average equal error rate (EER) by 46% when compared with the EER of the speaker-dependent HMM verifier. It is also shown that the computational and model storage requirements needed to incorporate MDD into the hybrid system are relatively small
Keywords :
error statistics; hidden Markov models; speaker recognition; HMM mixture components; average equal error rate; cohort normalization; computational requirements; continuous density hidden Markov models; discriminative training; experimental results; hybrid verification system; imposters; mixture decomposition discrimination; mixture information; mixture profile; speaker dependent HMM; speaker independent continuous density HMM; speaker verification; speech modeling; storage requirements; true speaker; Computational complexity; Computational modeling; Error analysis; Hidden Markov models; Multilayer perceptrons; Speech recognition; Testing; Vectors; Vocabulary;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.841211
Filename :
841211
Link To Document :
بازگشت