DocumentCode :
3528880
Title :
Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE
Author :
Nosratighods, Mohaddeseh ; Thiruvaran, Tharmarajah ; Epps, Julien ; Ambikairajah, Eliathamby ; Ma, Bin ; Li, Haizhou
Author_Institution :
Sch. of Electr. Eng. & Telecommun., Univ. of New South Wales, Sydney, NSW
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4233
Lastpage :
4236
Abstract :
In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.
Keywords :
cepstral analysis; frequency modulation; speaker recognition; EER; MFCC features; MFCC subsystem; NIST SRE-06 dataset; NIST SRE-08 telephone data; cepstral-based speaker recognition system; channel variations; frequency modulation; telephone conversations; Australia; Frequency estimation; Frequency modulation; Humans; Mel frequency cepstral coefficient; NIST; Psychoacoustic models; Resonance; Speaker recognition; Speech; Frequency Modulation; Fusion; MFCC; Speaker Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960563
Filename :
4960563
Link To Document :
بازگشت