مرکز منطقه ای اطلاع رساني علوم و فناوري - Text-independent speaker identification using GMM-UBM and frame level likelihood normalization

DocumentCode :

2839266

Title :

Text-independent speaker identification using GMM-UBM and frame level likelihood normalization

Author :

Zheng, Rong ; Zhang, Shuwu ; Xu, Bo

Author_Institution :

Inst. of Autom., Chinese Acad. of Sci., Beijing, China

fYear :

2004

fDate :

15-18 Dec. 2004

Firstpage :

289

Lastpage :

292

Abstract :

In this paper, we describe a Gaussian mixture model-universal background model (GMM-UBM) speaker identification system. In this GMM-UBM system, we derive the hypothesized speaker model by adapting the parameters of UBM using the speaker´s training speech and a form of Bayesian adaptation. The UBM technique is incorporated into the GMM speaker identification system to reduce the time requirement for recognition significantly. The paper also presents a new frame level likelihood score normalization for adjusting different scores of speaker models to get more robust scores in the final decision. Experiments on the 2000 NIST speaker recognition evaluation corpus show that GMM-UBM and frame level likelihood score normalization yield better performance. Compared to the baseline system, around 31.2% relative error reduction is obtained from the combination of both techniques.

Keywords :

Bayes methods; Gaussian distribution; error statistics; speaker recognition; 2000 NIST speaker recognition evaluation corpus; Bayesian adaptation; GMM-UBM; Gaussian mixture model-universal background model; error reduction; frame level likelihood normalization; score normalization; text-independent speaker identification; training speech; Adaptation model; Bayesian methods; Laboratories; Microphones; Pattern recognition; Robustness; Speaker recognition; Speech; Technological innovation; Telephony;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing, 2004 International Symposium on

Print_ISBN :

0-7803-8678-7

Type :

conf

DOI :

10.1109/CHINSL.2004.1409643

Filename :

1409643

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2839266