DocumentCode
2178843
Title
A partial least squares framework for speaker recognition
Author
Srinivasan, Balaji Vasan ; Zotkin, Dmitry N. ; Duraiswami, Ramani
Author_Institution
Dept. of Comput. Sci., Univ. of Maryland, College Park, MD, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5276
Lastpage
5279
Abstract
Modern approaches to speaker recognition (verification) operate in a space of "supervectors" created via concatenation of the mean vectors of a Gaussian mixture model (GMM) adapted from a universal background model (UBM). In this space, a number of approaches to model inter-class separability and nuisance attribute variability have been proposed. We develop a method for modeling the variability associated with each class (speaker) by using partial-least-squares - a latent variable modeling technique, which isolates the most informative subspace for each speaker. The method is tested on NIST SRE 2008 data and provides promising results. The method is shown to be noise-robust and to be able to efficiently learn the subspace corresponding to a speaker on training data consisting of multiple utterances.
Keywords
Gaussian processes; least squares approximations; speaker recognition; GMM; Gaussian mixture model; NIST SRE; interclass separability; latent variable modeling technique; multiple utterances; nuisance attribute variability; partial least squares; partial-least-squares; speaker recognition; speaker verification; universal background model; Adaptation models; NIST; Speaker recognition; Speech; Support vector machines; Training; Training data; GMM supervectors; Partial least squares; latent vector; speaker recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947548
Filename
5947548
Link To Document