مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation

DocumentCode :

395305

Title :

Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation

Author :

Mak, Man-Wai ; Cheung, Ming-Cheung ; Kung, Sun-Yuan

Volume :

fYear :

2003

fDate :

6-10 April 2003

Abstract :

In speaker verification, a claimant may produce two or more utterances. Typically, the scores of the speech patterns extracted from these utterances are averaged and the resulting mean score is compared with a decision threshold. Rather than simply computing the mean score, we propose to compute the optimal weights for fusing the scores based on the score distribution of the independent utterances and our prior knowledge about the score statistics. More specifically, we use enrollment data to compute the mean scores of client speakers and impostors and consider them to be the prior scores. During verification, we set the fusion weights for individual speech patterns to be a function of the dispersion between the scores of these speech patterns and the prior scores. Experimental results based on the GSM-transcoded speech of 150 speakers from the HTIMIT corpus demonstrate that the proposed fusion algorithm can increase the dispersion between the mean speaker scores and the mean impostor scores. Compared with a baseline approach where equal weights are assigned to all scores, the proposed approach provides a relative error reduction of 19%.

Keywords :

cellular radio; data compression; feature extraction; optimisation; speaker recognition; speech coding; stochastic processes; GSM-transcoded; GSM-transcoded speech; HTIMIT corpus; client speakers; decision fusion; decision threshold; enrollment data; error reduction; feature transformation; fusion algorithm; fusion weights; independent utterances; mean impostor scores; mean speaker scores; optimal weights; robust speaker verification; score distribution; speech patterns; stochastic feature transformation; Aging; Degradation; Distributed computing; Feature extraction; GSM; Robustness; Signal processing; Signal processing algorithms; Speech recognition; Statistical distributions;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-7663-3

Type :

conf

DOI :

10.1109/ICASSP.2003.1202474

Filename :

1202474

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=395305