Title :
Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications
Author :
Apsingekar, Vijendra Raj ; De Leon, Phillip L.
Author_Institution :
Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM
fDate :
5/1/2009 12:00:00 AM
Abstract :
In large population speaker identification (SI) systems, likelihood computations between an unknown speaker´s feature vectors and the registered speaker models can be very time-consuming and impose a bottleneck. For applications requiring fast SI, this is a recognized problem and improvements in efficiency would be beneficial. In this paper, we propose a method whereby GMM-based speaker models are clustered using a simple k-means algorithm. Then, during the test stage, only a small proportion of speaker models in selected clusters are used in the likelihood computations resulting in a significant speed-up with little to no loss in accuracy. In general, as the number of selected clusters is reduced, the identification accuracy decreases; however, this loss can be controlled through proper tradeoff. The proposed method may also be combined with other test stage speed-up techniques resulting in even greater speed-up gains without additional sacrifices in accuracy.
Keywords :
Gaussian processes; maximum likelihood estimation; pattern clustering; speaker recognition; GMM; Gaussian mixture model; large population application; maximum likelihood estimation; speaker identification; speaker model clustering; Cepstral analysis; Clustering algorithms; Covariance matrix; Forensics; Indexing; Signal processing; Speaker recognition; Speech; Support vector machines; System testing; Clustering methods; speaker recognition;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2008.2010882