مرکز منطقه ای اطلاع رساني علوم و فناوري - Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models

DocumentCode :

2327624

Title :

Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models

Author :

Markov, Konstantin ; Nakagawa, Seiichi

Author_Institution :

Dept. of Inf. & Comput. Sci., Toyohashi Univ. of Technol., Japan

Volume :

fYear :

1996

fDate :

3-6 Oct 1996

Firstpage :

1764

Abstract :

Proposes a new speaker identification system, where the likelihood normalization technique, which is widely used for speaker verification, is introduced. In the new system, which is based on Gaussian mixture models, every frame of the test utterance is input to all the reference models in parallel. In this procedure, for each frame, likelihoods from all the models are available, and hence they can be normalized at every frame. A special kind of likelihood normalization, called the `weighting models rank´, is also proposed. Experiments were performed using two databases-TIMIT and NTT. Evaluation results clearly show that the frame-level likelihood normalization technique is superior to the standard accumulated likelihood approach

Keywords :

Gaussian distribution; speaker recognition; Gaussian mixture models; NTT database; TIMIT database; frame-level likelihood normalization; parallel input; reference models; test utterance frames; text-independent speaker identification; weighting models rank; Databases; Hidden Markov models; Parameter estimation; System testing; Tin;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on

Conference_Location :

Philadelphia, PA

Print_ISBN :

0-7803-3555-4

Type :

conf

DOI :

10.1109/ICSLP.1996.607970

Filename :

607970

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2327624