DocumentCode
2799398
Title
Robust speaker identification using an auditory-based feature
Author
Li, Qi ; Huang, Yan
Author_Institution
Li Creative Technol. (LcT), Inc., Florham Park, NJ, USA
fYear
2010
fDate
14-19 March 2010
Firstpage
4514
Lastpage
4517
Abstract
An auditory-based feature extraction algorithm is presented. The feature is based on a recently published time-frequency transform plus a set of modules to simulate the signal processing functions in the cochlea. The feature is applied to a speaker identification task to address the acoustic mismatch problem between training and testing. Usually, the performances of acoustic models trained in clean speech drop significantly when tested on noisy speech. The proposed feature has shown strong robustness in the mismatched situation. As shown in our experiments, in a speaker identification task, both MFCC and the proposed feature have near perfect performances in a clean testing condition, but when the SNR of input signal drops to 6 dB, the average accuracy of the MFCC feature is only 41.2%, while the proposed feature still achieves an average accuracy of 88.3%.
Keywords
Fourier transforms; acoustic signal processing; cepstral analysis; ear; feature extraction; hearing; speaker recognition; SNR; acoustic mismatch problem; auditory based feature extraction algorithm; mel frequency cepstral coefficient; noisy speech; robust speaker identification; signal processing function; signal-to-noise ratio; time frequency transform; Acoustic signal processing; Acoustic testing; Feature extraction; Loudspeakers; Mel frequency cepstral coefficient; Performance evaluation; Robustness; Signal processing algorithms; Speech; Time frequency analysis; Speech feature extraction; auditory-based feature; cochlea; robust speaker recognition; speaker identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495589
Filename
5495589
Link To Document