Title :
Identifying in-set and out-of-set speakers using neighborhood information
Author :
Angkititrakul, Pongtep ; Hansen, John H L
Author_Institution :
Robust Speech Process. Group, Center for Spoken Language Res., Colorado Univ., Boulder, CO, USA
Abstract :
We study the problem of identifying in-set and out-of-set speakers. The goal is to identify whether an unknown input speaker belongs to either a group of in-set speakers or an unseen out-of-set group. A state-of-the-art GMM classifier, with universal background model (UBM) and standard likelihood ratio test, is used as our baseline system. We propose an alternative hypothesis testing method that employs neighborhood information with respect to each in-set speaker model in the model space based on the Kullback-Leibier divergence. The Bayes factor is used in the verification stage (accept/reject hypothesis). We evaluate the proposed procedure on a clean CORPUS 1 set, and a noisy CORPUS 2 set which contains session-to-session variability. Experiments show an improvement in equal error rate for the system even when in-set speaker models are acoustically close in the model space, and as the size of the in-set speaker group increases.
Keywords :
Bayes methods; Gaussian processes; error statistics; speaker recognition; Bayes factor; GMM classifier; Kullback-Leibier divergence; equal error rate; hypothesis testing method; in-set speaker identification; neighborhood information; out-of-set speaker identification; standard likelihood ratio test; universal background model; Bayesian methods; Error analysis; Forensics; Loudspeakers; Natural languages; Robustness; Speaker recognition; Speech processing; System testing; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326005