مرکز منطقه ای اطلاع رساني علوم و فناوري - Frame pruning for automatic speaker identification

DocumentCode :

703260

Title :

Frame pruning for automatic speaker identification

Author :

Besacier, L. ; Bonastre, J.F.

Author_Institution :

LIA/CERI, Avignon, France

fYear :

1998

fDate :

8-11 Sept. 1998

Firstpage :

Lastpage :

Abstract :

In this paper, we propose a frame selection procedure for text-independent speaker identification. Instead of averaging the frame likelihoods along the whole test utterance, some of these are rejected (pruning) and the final score is computed with a limited number of frames. This pruning stage requires a prior frame level likelihood normalization in order to make comparison between frames meaningful. This normalization procedure alone leads to a significative performance enhancement. As far as pruning is concerned, the optimal number of frames pruned is learned on a tuning data set for normal and telephone speech. Validation of the pruning procedure on 567 speakers leads to a significative improvement on TIMIT and NTIMIT (up to 30% error rate reduction on TIMIT).

Keywords :

speaker recognition; NTIMIT; TIMIT; automatic speaker identification; error rate reduction; frame level likelihood normalization; frame pruning stage; frame selection procedure; telephone speech; textindependent speaker identification; utterance testing; Databases; Protocols; Speaker recognition; Speech; Speech processing; Training; Tuning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing Conference (EUSIPCO 1998), 9th European

Conference_Location :

Rhodes

Print_ISBN :

978-960-7620-06-4

Type :

conf

Filename :

7089731

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=703260