Title :
Survey and evaluation of acoustic features for speaker recognition
Author :
Lawson, A. ; Vabishchevich, P. ; Huggins, M. ; Ardis, P. ; Battles, B. ; Stauffer, A.
Author_Institution :
RADC, Inc., Rome, NY, USA
Abstract :
This study seeks to quantify the effectiveness of a broad range of acoustic features for speaker identification and their impact in feature fusion. Sixteen different acoustic features are evaluated under nine different acoustic, channel and speaking style conditions. Three major types of features are examined: traditional (MFCC, PLP, LPCC, etc.), innovative (PYKFEC, MVDR, etc.) and extensions of these (frequency-constrained LPCC, LFCC). All features were then fused in binary and three-way fusion to determine the complementarity between features and their impact on accuracy. Results were surprising, with the MVDR feature having the highest performance for any single feature, and LPCC based features having the greatest impact on fusion effectiveness. Commonly used features like PLP and MFCC did not achieve the best results in any category. It was further found that removing the perceptually-motivated warping from MFCC, MVDR and PYKFEC improved the performance of these features significantly.
Keywords :
speaker recognition; LPCC; MFCC; MVDR; PLP; PYKFEC; acoustic features; fusion effectiveness; perceptually-motivated warping; speaker recognition; Accuracy; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Noise measurement; Speaker recognition; acoustic features; feature fusion; speaker recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947590