Title :
Sparse Representation for Speaker Identification
Author :
Naseem, Imran ; Togneri, Roberto ; Bennamoun, Mohammed
Author_Institution :
Univ. of Western Australia, Perth, WA, Australia
Abstract :
We address the closed-set problem of speaker identification by presenting a novel sparse representation classification algorithm. We propose to develop an over complete dictionary using the GMM mean super vector kernel for all the training utterances. A given test utterance corresponds to only a small fraction of the whole training database. We therefore propose to represent a given test utterance as a linear combination of all the training utterances, thereby generating a naturally sparse representation. Using this sparsity, the unknown vector of coefficients is computed via l1-minimization which is also the sparsest solution. Ideally, the vector of coefficients so obtained has nonzero entries representing the class index of the given test utterance. Experiments have been conducted on the standard TIMIT database and a comparison with the state-of-art speaker identification algorithms yields a favorable performance index for the proposed algorithm.
Keywords :
Gaussian processes; minimisation; performance index; signal classification; signal representation; speaker recognition; GMM mean super vector kernel; closed set problem; minimization; performance index; sparse representation classification algorithm; speaker identification; test utterance correspond; Adaptation model; Algorithm design and analysis; Databases; Hidden Markov models; Signal processing algorithms; Speaker recognition; Training;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.1083