DocumentCode :
395305
Title :
Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation
Author :
Mak, Man-Wai ; Cheung, Ming-Cheung ; Kung, Sun-Yuan
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
In speaker verification, a claimant may produce two or more utterances. Typically, the scores of the speech patterns extracted from these utterances are averaged and the resulting mean score is compared with a decision threshold. Rather than simply computing the mean score, we propose to compute the optimal weights for fusing the scores based on the score distribution of the independent utterances and our prior knowledge about the score statistics. More specifically, we use enrollment data to compute the mean scores of client speakers and impostors and consider them to be the prior scores. During verification, we set the fusion weights for individual speech patterns to be a function of the dispersion between the scores of these speech patterns and the prior scores. Experimental results based on the GSM-transcoded speech of 150 speakers from the HTIMIT corpus demonstrate that the proposed fusion algorithm can increase the dispersion between the mean speaker scores and the mean impostor scores. Compared with a baseline approach where equal weights are assigned to all scores, the proposed approach provides a relative error reduction of 19%.
Keywords :
cellular radio; data compression; feature extraction; optimisation; speaker recognition; speech coding; stochastic processes; GSM-transcoded; GSM-transcoded speech; HTIMIT corpus; client speakers; decision fusion; decision threshold; enrollment data; error reduction; feature transformation; fusion algorithm; fusion weights; independent utterances; mean impostor scores; mean speaker scores; optimal weights; robust speaker verification; score distribution; speech patterns; stochastic feature transformation; Aging; Degradation; Distributed computing; Feature extraction; GSM; Robustness; Signal processing; Signal processing algorithms; Speech recognition; Statistical distributions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202474
Filename :
1202474
Link To Document :
بازگشت