DocumentCode
2066955
Title
Interfusing the Confused Region Score of Speaker Verification Systems
Author
Long, Yanhua ; Guo, Wu ; Dai, Lirong
Author_Institution
Dept. of Electron. Eng. & Inf. Sci., Univ. of Sci. & Technol. of China, Hefei, China
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
In the text-independent speaker recognition field, there have been many excellent techniques based on the cepstral acoustic features. Recently, high level prosodic features have been widely used to verify the speaker´s identity as they are less sensitive to the channel and noisy effect. But how to combine the existed prosodic system´s scores with the scores of the system based on acoustic features to achieve a superior performance becomes a very difficult issue. This paper presents a combination method called interfusing the confused region scores (ICRS) to achieve a better recognition precisio.n. We report results on the NIST 2006 speaker recognition evaluation (SRE) using two component systems: a standard MFCC-SVM and PGCP- SVM prosodic system, and show that the proposed interfusing technique results in 9.25% reduction in equal error rate (EER) and at last gets a EER = 4.9% after system´s combination.
Keywords
speech recognition; cepstral acoustic features; equal error rate; recognition precision; speaker recognition evaluation; speaker verification systems; text-independent speaker recognition; Acoustic noise; Acoustical engineering; Cepstral analysis; Information science; Loudspeakers; NIST; Noise level; Speaker recognition; Speech; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.90
Filename
4730344
Link To Document