DocumentCode :
2327624
Title :
Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models
Author :
Markov, Konstantin ; Nakagawa, Seiichi
Author_Institution :
Dept. of Inf. & Comput. Sci., Toyohashi Univ. of Technol., Japan
Volume :
3
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1764
Abstract :
Proposes a new speaker identification system, where the likelihood normalization technique, which is widely used for speaker verification, is introduced. In the new system, which is based on Gaussian mixture models, every frame of the test utterance is input to all the reference models in parallel. In this procedure, for each frame, likelihoods from all the models are available, and hence they can be normalized at every frame. A special kind of likelihood normalization, called the `weighting models rank´, is also proposed. Experiments were performed using two databases-TIMIT and NTT. Evaluation results clearly show that the frame-level likelihood normalization technique is superior to the standard accumulated likelihood approach
Keywords :
Gaussian distribution; speaker recognition; Gaussian mixture models; NTT database; TIMIT database; frame-level likelihood normalization; parallel input; reference models; test utterance frames; text-independent speaker identification; weighting models rank; Databases; Hidden Markov models; Parameter estimation; System testing; Tin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607970
Filename :
607970
Link To Document :
بازگشت