DocumentCode :
2874929
Title :
Text independent speaker identification using automatic acoustic segmentation
Author :
Rose, Richard C. ; Reynolds, Douglas A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
fYear :
1990
fDate :
3-6 Apr 1990
Firstpage :
293
Abstract :
An acoustic-class-dependent technique for text-independent speaker identification on very short utterances is described. The technique is based on maximum-likelihood estimation of a Gaussian mixture model representation of speaker identity. Gaussian mixtures are noted for their robustness as a parametric model and their ability to form smooth estimates of rather arbitrary underlying densities. Speaker model parameters are estimated using a special case of the iterative expectation-maximization (EM) algorithm, and a number of techniques are investigated for improving model robustness. The system is evaluated using a 12 reference speaker population from a conversational speech database. It achieves 80% average text-independent speaker identification performance for a 1-s test utterance length
Keywords :
iterative methods; parameter estimation; speech recognition; Gaussian mixture model representation; acoustic-class-dependent technique; automatic acoustic segmentation; conversational speech database; iterative expectation-maximization; maximum-likelihood estimation; parametric model; robustness; smooth estimates; speaker identity; speaker population; text-independent speaker identification; utterance length; very short utterances; Databases; Gaussian distribution; Iterative algorithms; Laboratories; Loudspeakers; Maximum likelihood estimation; Parameter estimation; Parametric statistics; Robustness; Speech analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1990.115638
Filename :
115638
Link To Document :
بازگشت