DocumentCode :
454533
Title :
Secondary Classification for GMM Based Speaker Recognition
Author :
Pelecanos, Jason ; Povey, Dan ; Ramaswamy, Ganesh
Author_Institution :
Conversational Biometrics Group, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
This paper discusses the use of a secondary classifier to reweight the frame-based scores of a speaker recognition system according to which region in feature space they belong. The score mapping function is constructed to perform a likelihood ratio (LR) correction of the original LR scores. This approach has the ability to limit the effect of rogue model components and regions of feature space that may not be robust to different audio environments, handset types or speakers. Prior information available from tests on a development data set can be used to determine a log-likelihood-ratio mapping function that more appropriately weights each speech frame. The computational overhead for this approach in online mode is close to negligible for significant performance gains shown for the NIST 2004 Speaker Recognition Evaluation data
Keywords :
Gaussian processes; speaker recognition; GMM; Gaussian mixture model; frame-based scores; log-likelihood-ratio mapping function; secondary classification; speaker recognition; speech frame; Bayesian methods; Biometrics; NIST; Performance gain; Robustness; Speaker recognition; Speech synthesis; Statistics; Telephone sets; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1659969
Filename :
1659969
Link To Document :
بازگشت