Title :
Speaker recognition via sparse representations using orthogonal matching pursuit
Author :
Boominathan, Vivek ; Murty, K. Sri Rama
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Hyderabad, Hyderabad, India
Abstract :
The objective of this paper is to demonstrate the effectiveness of sparse representation techniques for speaker recognition. In this approach, each feature vector from unknown utterance is expressed as linear weighted sum of a dictionary of feature vectors belonging to many speakers. The weights associated with feature vectors in the dictionary are evaluated using orthogonal matching pursuit algorithm, which is a greedy approximation to l0 optimization. The weights thus obtained exhibit high level of sparsity, and only a few of them will have nonzero values. The feature vectors which belong to the correct speaker carry significant weights. The proposed method gives an equal error rate (EER) of 10.84% on NIST-2003 database, whereas the existing GMM-UBM system gives an EER of 9.67%. By combining evidence from both the systems an EER of 8.15% is achieved, indicating that both the systems carry complimentary information.
Keywords :
approximation theory; feature extraction; greedy algorithms; optimisation; speaker recognition; GMM-UBM system; NIST-2003 database; equal error rate; feature vector; greedy approximation; l0 optimization; linear weighted sum; orthogonal matching pursuit; sparse representations; speaker recognition; Adaptation models; Feature extraction; Matching pursuit algorithms; Speaker recognition; Testing; Training; Vectors; Sparse representation; l0 optimization and Gaussian mixture modeling; orthogonal matching pursuit;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288890