Title :
Speaker identification using minimum classification error training
Author :
Siohan, Olivier ; Rosenberg, Aaron E. ; Parthasarathy, Srinivasan
Author_Institution :
AT&T Bell Labs. Res., Florham Park, NJ, USA
Abstract :
We use a minimum classification error (MCE) training paradigm to build a speaker identification system. The training is optimized at the string level for a text-dependent speaker identification task. Experiments performed on a small set speaker identification task show that MCE training can reduce closed-set identification errors by up to 20-25% over a baseline system trained using maximum likelihood estimation. Further experiments suggest that additional improvement can be obtained by using some additional training data from speakers outside the set of registered speakers, leading to an overall reduction of the closed-set identification errors by about 35%
Keywords :
error statistics; hidden Markov models; maximum likelihood estimation; pattern classification; speaker recognition; HMM; MCE training; baseline system; closed-set identification error reduction; experiments; maximum likelihood estimation; minimum classification error training; optimisation; speaker identification; string level; text-dependent speaker identification; training data; Argon; Databases; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Parameter estimation; Speech recognition; Telephony; Training data;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.674379