Interfusing the Confused Region Score of Speaker Verification Systems

Author

Long, Yanhua ; Guo, Wu ; Dai, Lirong

Author_Institution

Dept. of Electron. Eng. & Inf. Sci., Univ. of Sci. & Technol. of China, Hefei, China

fYear

2008

fDate

16-19 Dec. 2008

Firstpage

1

Lastpage

4

Abstract

In the text-independent speaker recognition field, there have been many excellent techniques based on the cepstral acoustic features. Recently, high level prosodic features have been widely used to verify the speaker´s identity as they are less sensitive to the channel and noisy effect. But how to combine the existed prosodic system´s scores with the scores of the system based on acoustic features to achieve a superior performance becomes a very difficult issue. This paper presents a combination method called interfusing the confused region scores (ICRS) to achieve a better recognition precisio.n. We report results on the NIST 2006 speaker recognition evaluation (SRE) using two component systems: a standard MFCC-SVM and PGCP- SVM prosodic system, and show that the proposed interfusing technique results in 9.25% reduction in equal error rate (EER) and at last gets a EER = 4.9% after system´s combination.

Keywords

speech recognition; cepstral acoustic features; equal error rate; recognition precision; speaker recognition evaluation; speaker verification systems; text-independent speaker recognition; Acoustic noise; Acoustical engineering; Cepstral analysis; Information science; Loudspeakers; NIST; Noise level; Speaker recognition; Speech; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on

Conference_Location

Kunming

Print_ISBN

978-1-4244-2942-4

Electronic_ISBN

978-1-4244-2943-1

Type

conf

DOI

10.1109/CHINSL.2008.ECP.90

Filename

4730344