Title :
Computationally efficient frame-averaged FM feature extraction for speaker recognition
Author :
Thiruvaran, Tharmarajah ; Nosratighods, M. ; Ambikairajah, E. ; Epps, Julien
Author_Institution :
Sch. of Electr. Eng., Univ. of New South Wales, Sydney, NSW
Abstract :
Recently, subband frame-averaged frequency modulation (FM) as a complementary feature to amplitude-based features for several speech based classification problems including speaker recognition has shown promise. One problem with using FM extraction in practical implementations is computational complexity. Proposed is a computationally efficient method to estimate the frame-averaged FM component in a novel manner, using zero crossing counts and the zero crossing counts of the differentiated signal. FM components, extracted from subband speech signals using the proposed method, form a feature vector. Speaker recognition experiments conducted on the NIST 2008 telephone database show that the proposed method successfully augments mel frequency cepstrum coefficients (MFCCs) to improve performance, obtaining 17% relative reductions in equal error rates when compared with an MFCC-based system.
Keywords :
cepstral analysis; computational complexity; feature extraction; frequency modulation; speaker recognition; speech processing; MFCC; NIST 2008 telephone database; amplitude-based feature extraction; computationally efficient method; frame-averaged frequency modulation; mel frequency cepstrum coefficients; speaker recognition; speech signal; zero crossing counts;
Journal_Title :
Electronics Letters
DOI :
10.1049/el.2009.0170