DocumentCode :
3488372
Title :
Feature and score normalization for speaker verification of cellular data
Author :
Barras, Claude ; Gauvain, Jean-Luc
Author_Institution :
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
This paper presents some experiments with feature and score normalization for text-independent speaker verification of cellular data. The speaker verification system is based on cepstral features and Gaussian mixture models with 1024 components. The following methods, which have been proposed for feature and score normalization, are reviewed and evaluated on cellular data: cepstral mean subtraction (CMS), variance normalization, feature warping, T-norm, Z-norm and the cohort method. We found that the combination of feature warping and T-norm gives the best results on the NIST 2002 test data (for the one-speaker detection task). Compared to a baseline system using both CMS and variance normalization and achieving a 0.410 minimal decision cost function (DCF), feature warping and T-norm respectively bring 8% and 12% relative reductions, whereas the combination of both techniques yields a 22% relative reduction, reaching a DCF of 0.320. This result approaches the state-of-the-art performance level obtained for speaker verification with land-line telephone speech.
Keywords :
Gaussian processes; cellular radio; feature extraction; speaker recognition; Gaussian mixture models; NIST 2002 test data; T-norm method; Z-norm method; baseline system; cellular data; cepstral features; cepstral mean subtraction; cohort method; feature normalization; feature warping; land-line telephone speech; minimal decision cost function; one-speaker detection task; score normalization; speaker verification; text-independent speaker verification system; variance normalization; Cepstral analysis; Collision mitigation; Cost function; Loudspeakers; NIST; Natural languages; Speech analysis; Telephony; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202291
Filename :
1202291
Link To Document :
بازگشت