Title :
Speaker Recognition using a Kind of Novel Phonotactic Information
Author :
Zhang, Xiang ; Xiao, Xiang ; Wang, Haipeng ; Suo, Hongbin ; Zhao, Qingwei ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
Abstract :
In this paper, we present a new modeling approach for speaker recognition, which uses a kind of novel phonotactic information as the feature for S VM modeling. Gaussian mixture models (GMMs) have been proven extremely successful for text- independent speaker recognition. The GMM universal background model (UBM) is a speaker-independent model, each component of which can be considered to be modeling some underlying phonetic sounds. Thus, the UBM can be regarded to characterize a speaker-independent voice. We assume that the utterances from different speakers should get different average posterior probabilities on the same Gaussian component of the UBM, and the supervector composed of the average posterior probabilities on all components of the UBM for each utterance should be discriminative. We use these supervectors as the features for SVM based speaker recognition. Experiment results show that the proposed approach demonstrates comparable performance with the state-of-the-art systems on NIST 2006 SRE corpus. Fusion results are also presented.
Keywords :
Gaussian processes; speaker recognition; support vector machines; GMM universal background model; Gaussian mixture models; SVM modeling; phonotactic information; posterior probabilities; speaker recognition; Acoustics; Cepstral analysis; Feature extraction; Histograms; Loudspeakers; NIST; Research and development; Speaker recognition; Speech recognition; Support vector machines;
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
DOI :
10.1109/CHINSL.2008.ECP.94