DocumentCode :
2839266
Title :
Text-independent speaker identification using GMM-UBM and frame level likelihood normalization
Author :
Zheng, Rong ; Zhang, Shuwu ; Xu, Bo
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fYear :
2004
fDate :
15-18 Dec. 2004
Firstpage :
289
Lastpage :
292
Abstract :
In this paper, we describe a Gaussian mixture model-universal background model (GMM-UBM) speaker identification system. In this GMM-UBM system, we derive the hypothesized speaker model by adapting the parameters of UBM using the speaker´s training speech and a form of Bayesian adaptation. The UBM technique is incorporated into the GMM speaker identification system to reduce the time requirement for recognition significantly. The paper also presents a new frame level likelihood score normalization for adjusting different scores of speaker models to get more robust scores in the final decision. Experiments on the 2000 NIST speaker recognition evaluation corpus show that GMM-UBM and frame level likelihood score normalization yield better performance. Compared to the baseline system, around 31.2% relative error reduction is obtained from the combination of both techniques.
Keywords :
Bayes methods; Gaussian distribution; error statistics; speaker recognition; 2000 NIST speaker recognition evaluation corpus; Bayesian adaptation; GMM-UBM; Gaussian mixture model-universal background model; error reduction; frame level likelihood normalization; score normalization; text-independent speaker identification; training speech; Adaptation model; Bayesian methods; Laboratories; Microphones; Pattern recognition; Robustness; Speaker recognition; Speech; Technological innovation; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN :
0-7803-8678-7
Type :
conf
DOI :
10.1109/CHINSL.2004.1409643
Filename :
1409643
Link To Document :
بازگشت