Title :
Subspace construction and selection for speaker recognition
Author :
Long, Yanhua ; Guo, Wu ; Ma, Bin ; Chng, Eng Siong ; Zhu, Donglai ; Dai, Lirong ; Li, Haizhou
Author_Institution :
iFly Speech Lab., Univ. of Sci. & Technol. of China (USTC), Hefei, China
Abstract :
In this paper, we propose a subspace construction and selection strategy (SUBS) for speaker recognition with limited training and testing speech data. Based on the individual Gaussian distributions of Gaussian mixture model (GMM), each speaker´s characteristic subspace is constructed by training an SVM using the corresponding Gaussian mean vectors from the GMMs of both enrollment and imposter speakers. A subspace selection based on the structure risk criterion is used to select those subspaces with lower structure risks. The selected subspaces are then combined and used to evaluate the test utterances. We evaluate this subspace strategy on the 10sec-10sec test condition in 2008 NIST speaker recognition evaluations, achieving a relative 12.16% equal error rate reduction over the GMM supervector baseline system.
Keywords :
Gaussian distribution; speaker recognition; support vector machines; GMM supervector baseline system; Gaussian mean vectors; Gaussian mixture model; NIST speaker recognition evaluations; SVM training; error rate reduction; individual Gaussian distributions; speaker characteristic subspace; speech data testing; structure risk criterion; subspace construction and selection strategy; support vector machine; Gaussian distribution; Maximum likelihood linear regression; Modular construction; NIST; Speaker recognition; Speech; Strontium; Support vector machines; Testing; Training data;
Conference_Titel :
Information, Communications and Signal Processing, 2009. ICICS 2009. 7th International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4244-4656-8
Electronic_ISBN :
978-1-4244-4657-5
DOI :
10.1109/ICICS.2009.5397658