DocumentCode :
2238525
Title :
Time sequence information within a Gaussian mixture model
Author :
Stapert, R.P. ; Mason, J.S.
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of Wales Swansea, Swansea, UK
fYear :
2002
fDate :
3-6 Sept. 2002
Firstpage :
1
Lastpage :
4
Abstract :
This paper addresses the task of text independent speaker recognition and in particular looks at capturing time sequence information within the modelling process itself. A recent extension to the popular Gaussian mixture model (GMM) is the segmental mixture model (SMM), and its advantages are thought to be more pronounced as more and more training data becomes available. Here this idea is examined along with a hypothesis on model size, model complexity and their dependencies on the quantity of available training data. Experimental results on a 2000 speaker database show that an SMM does offer better recognition results than a GMM once a threshold in the amount of training data has been reached.
Keywords :
Gaussian processes; computational complexity; mixture models; speaker recognition; GMM; Gaussian mixture model; SMM; model complexity; model size; segmental mixture model; text independent speaker recognition; time sequence information capture; training data; Abstracts; Complexity theory; Speech; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2002 11th European
Conference_Location :
Toulouse
ISSN :
2219-5491
Type :
conf
Filename :
7072196
Link To Document :
بازگشت