مرکز منطقه ای اطلاع رساني علوم و فناوري - Secondary Classification for GMM Based Speaker Recognition

DocumentCode :

454533

Title :

Secondary Classification for GMM Based Speaker Recognition

Author :

Pelecanos, Jason ; Povey, Dan ; Ramaswamy, Ganesh

Author_Institution :

Conversational Biometrics Group, IBM Thomas J. Watson Res. Center, Yorktown Heights, NY

Volume :

fYear :

2006

fDate :

14-19 May 2006

Abstract :

This paper discusses the use of a secondary classifier to reweight the frame-based scores of a speaker recognition system according to which region in feature space they belong. The score mapping function is constructed to perform a likelihood ratio (LR) correction of the original LR scores. This approach has the ability to limit the effect of rogue model components and regions of feature space that may not be robust to different audio environments, handset types or speakers. Prior information available from tests on a development data set can be used to determine a log-likelihood-ratio mapping function that more appropriately weights each speech frame. The computational overhead for this approach in online mode is close to negligible for significant performance gains shown for the NIST 2004 Speaker Recognition Evaluation data

Keywords :

Gaussian processes; speaker recognition; GMM; Gaussian mixture model; frame-based scores; log-likelihood-ratio mapping function; secondary classification; speaker recognition; speech frame; Bayesian methods; Biometrics; NIST; Performance gain; Robustness; Speaker recognition; Speech synthesis; Statistics; Telephone sets; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location :

Toulouse

ISSN :

1520-6149

Print_ISBN :

1-4244-0469-X

Type :

conf

DOI :

10.1109/ICASSP.2006.1659969

Filename :

1659969

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=454533