مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speaker identification using an auditory-based feature

DocumentCode :

2799398

Title :

Robust speaker identification using an auditory-based feature

Author :

Li, Qi ; Huang, Yan

Author_Institution :

Li Creative Technol. (LcT), Inc., Florham Park, NJ, USA

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4514

Lastpage :

4517

Abstract :

An auditory-based feature extraction algorithm is presented. The feature is based on a recently published time-frequency transform plus a set of modules to simulate the signal processing functions in the cochlea. The feature is applied to a speaker identification task to address the acoustic mismatch problem between training and testing. Usually, the performances of acoustic models trained in clean speech drop significantly when tested on noisy speech. The proposed feature has shown strong robustness in the mismatched situation. As shown in our experiments, in a speaker identification task, both MFCC and the proposed feature have near perfect performances in a clean testing condition, but when the SNR of input signal drops to 6 dB, the average accuracy of the MFCC feature is only 41.2%, while the proposed feature still achieves an average accuracy of 88.3%.

Keywords :

Fourier transforms; acoustic signal processing; cepstral analysis; ear; feature extraction; hearing; speaker recognition; SNR; acoustic mismatch problem; auditory based feature extraction algorithm; mel frequency cepstral coefficient; noisy speech; robust speaker identification; signal processing function; signal-to-noise ratio; time frequency transform; Acoustic signal processing; Acoustic testing; Feature extraction; Loudspeakers; Mel frequency cepstral coefficient; Performance evaluation; Robustness; Signal processing algorithms; Speech; Time frequency analysis; Speech feature extraction; auditory-based feature; cochlea; robust speaker recognition; speaker identification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495589

Filename :

5495589

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2799398