Title :
Vector quantization techniques for GMM based speaker verification
Author :
Singh, Gurmeet ; Panda, Ashish ; Bhattacharyya, Souvik ; Srikanthan, Thambipillai
Author_Institution :
Indian Inst. of Technol., Kanpur, India
Abstract :
This paper explores the novel application of two vector quantization algorithms, namely Linde, Buzo, Gray (1980) and K-means algorithm for efficient speaker verification. Automatic speaker verification (ASV) is a memory and compute intensive process, giving rise to area and latency concerns in the way of its implementation for real-time efficient embedded systems. The training schemes for computing the speaker models, such as the expectation maximization are highly iterative and contribute significantly to the overall complexity in the implementation of the system. We demonstrate the use of the LBG and the K-means algorithm to realize compute efficient training method. Models trained with the LBG algorithm achieves as much as 99.88% of EM accuracy, whilst K-means achieves as much as 99.91% of EM accuracy. Moreover, the EM computational complexity is almost twice that of LBG or K-means. Thus, using LBG and K-means algorithms for training Gaussian mixture speaker models for text-independent speaker verification, we show that, that they deliver comparable performance as the EM algorithm at significantly reduced computational complexity. Thus making them an ideal choice for low-cost applications.
Keywords :
Gaussian processes; computational complexity; embedded systems; optimisation; speaker recognition; speech coding; vector quantisation; EM accuracy; EM computational complexity; GMM based speaker verification; Gaussian mixture model; Gaussian mixture speaker models training; K-means algorithm; LBG algorithm; Linde Buzo Gray algorithm; automatic speaker verification; efficient training method; expectation maximization; latency; real-time efficient embedded systems; speaker models; text-independent speaker verification; training schemes; vector quantization; Biometrics; Computational complexity; Convergence; Costs; Embedded computing; Embedded system; High performance computing; Iterative algorithms; Testing; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1202295