DocumentCode :
2516040
Title :
Sparse Representation for Speaker Identification
Author :
Naseem, Imran ; Togneri, Roberto ; Bennamoun, Mohammed
Author_Institution :
Univ. of Western Australia, Perth, WA, Australia
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
4460
Lastpage :
4463
Abstract :
We address the closed-set problem of speaker identification by presenting a novel sparse representation classification algorithm. We propose to develop an over complete dictionary using the GMM mean super vector kernel for all the training utterances. A given test utterance corresponds to only a small fraction of the whole training database. We therefore propose to represent a given test utterance as a linear combination of all the training utterances, thereby generating a naturally sparse representation. Using this sparsity, the unknown vector of coefficients is computed via l1-minimization which is also the sparsest solution. Ideally, the vector of coefficients so obtained has nonzero entries representing the class index of the given test utterance. Experiments have been conducted on the standard TIMIT database and a comparison with the state-of-art speaker identification algorithms yields a favorable performance index for the proposed algorithm.
Keywords :
Gaussian processes; minimisation; performance index; signal classification; signal representation; speaker recognition; GMM mean super vector kernel; closed set problem; minimization; performance index; sparse representation classification algorithm; speaker identification; test utterance correspond; Adaptation model; Algorithm design and analysis; Databases; Hidden Markov models; Signal processing algorithms; Speaker recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.1083
Filename :
5597865
Link To Document :
بازگشت