DocumentCode :
3433948
Title :
GMM based speaker identification using training-time-dependent number of mixtures
Author :
Tadj, Chakib ; Dumouchel, Pierre ; Ouellet, Pierre
Author_Institution :
Ecole de Technol. Superieure-Electr. Eng., Montreal, Que., Canada
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
761
Abstract :
In this paper, we present the study of the performance of our standard Gaussian mixture model (GMM) speaker identification system in “a limited amount of training data” context. We explore the use of different mixture components for different speakers/models. Different approaches are presented: (a) A nonlinear transformation of speech duration vs. number of mixtures is proposed in order to set correctly the appropriate number of model mixtures for each speaker according to the available training data. (b) From exhaustive experiments, the appropriate linear transformation is deduced. The resulting transformation offers several advantages: (a) each speaker is well modelized, (b) the performance is improved by more than 6% on the SPIDRE corpus and finally (c) the number of mixtures is reduced and thus leads to a faster system response
Keywords :
Gaussian distribution; speaker recognition; GMM speaker identification system; Gaussian mixture model; linear transformation; nonlinear transformation; speech duration; training-time-dependent number of mixtures; Additive noise; Degradation; Educational institutions; Microphones; Noise level; Nonlinear filters; Speech enhancement; Testing; Training data; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675376
Filename :
675376
Link To Document :
بازگشت