مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

323773

Title :

Frame pruning for speaker recognition

Author :

Besacier, L. ; Bonastre, J.F.

Author_Institution :

LIA/CERI, Avignon, France

Volume :

fYear :

1998

fDate :

12-15 May 1998

Firstpage :

765

Abstract :

In this paper, we propose a frame selection procedure for text-independent speaker identification. Instead of averaging the frame likelihoods along the whole test utterance, some of these are rejected (pruning) and the final score is computed with a limited number of frames. This pruning stage requires a prior frame level likelihood normalization in order to make comparison between frames meaningful. This normalization procedure alone leads to a significant performance enhancement. As far as pruning is concerned, the optimal number of frames pruned is learned on a tuning data set for normal and telephone speech. Validation of the pruning procedure on 567 speakers leads to a 27% identification rate improvement on TIMIT, and to 17% on NTIMIT

Keywords :

speaker recognition; NTIMIT database; TIMIT database; frame level likelihood normalization; frame pruning; frame selection procedure; normal speech; performance enhancement; speaker recognition; telephone speech; text-independent speaker identification; tuning data set; Arithmetic; Databases; Frequency; Loudspeakers; Noise robustness; Signal processing; Speaker recognition; Speech analysis; System testing; Telephony;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

Conference_Location :

Seattle, WA

ISSN :

1520-6149

Print_ISBN :

0-7803-4428-6

Type :

conf

DOI :

10.1109/ICASSP.1998.675377

Filename :

675377

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=323773