Title :
Integration of Complementary Acoustic Features for Speaker Recognition
Author :
Zheng, Nengheng ; Lee, Tan ; Ching, P.C.
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin
fDate :
3/1/2007 12:00:00 AM
Abstract :
This letter describes a speaker verification system that uses complementary acoustic features derived from the vocal source excitation and the vocal tract system. A new feature set, named the wavelet octave coefficients of residues (WOCOR), is proposed to capture the spectro-temporal source excitation characteristics embedded in the linear predictive residual signal. WOCOR is used to supplement the conventional vocal tract-related features, in this case, the Mel-frequency cepstral coefficients (MFCC), for speaker verification. A novel confidence measure-based score fusion technique is applied to integrate WOCOR and MFCC. Speaker verification experiments are carried out on the NIST 2001 database. The equal error rate (EER) attained with the proposed method is 7.67%, in comparison to 9.30% of the conventional MFCC-based system
Keywords :
acoustic signal processing; cepstral analysis; feature extraction; sensor fusion; speaker recognition; speech processing; wavelet transforms; MFCC; Mel-frequency cepstral coefficient; WOCOR; complementary acoustic feature; confidence measure; linear predictive residual signal; score fusion technique; speaker recognition; speaker verification system; spectro-temporal source excitation; vocal source excitation; vocal tract system; wavelet octave coefficients of residue; Cepstral analysis; Fourier transforms; Loudspeakers; Mel frequency cepstral coefficient; Power harmonic filters; Signal generators; Speaker recognition; Speech; Time frequency analysis; Wavelet transforms; Confidence measure; LP residual signal; information fusion; speaker verification; wavelet transform;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2006.884031