DocumentCode
2179775
Title
Survey and evaluation of acoustic features for speaker recognition
Author
Lawson, A. ; Vabishchevich, P. ; Huggins, M. ; Ardis, P. ; Battles, B. ; Stauffer, A.
Author_Institution
RADC, Inc., Rome, NY, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5444
Lastpage
5447
Abstract
This study seeks to quantify the effectiveness of a broad range of acoustic features for speaker identification and their impact in feature fusion. Sixteen different acoustic features are evaluated under nine different acoustic, channel and speaking style conditions. Three major types of features are examined: traditional (MFCC, PLP, LPCC, etc.), innovative (PYKFEC, MVDR, etc.) and extensions of these (frequency-constrained LPCC, LFCC). All features were then fused in binary and three-way fusion to determine the complementarity between features and their impact on accuracy. Results were surprising, with the MVDR feature having the highest performance for any single feature, and LPCC based features having the greatest impact on fusion effectiveness. Commonly used features like PLP and MFCC did not achieve the best results in any category. It was further found that removing the perceptually-motivated warping from MFCC, MVDR and PYKFEC improved the performance of these features significantly.
Keywords
speaker recognition; LPCC; MFCC; MVDR; PLP; PYKFEC; acoustic features; fusion effectiveness; perceptually-motivated warping; speaker recognition; Accuracy; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Noise measurement; Speaker recognition; acoustic features; feature fusion; speaker recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947590
Filename
5947590
Link To Document