DocumentCode :
2179788
Title :
Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
Author :
Sadjadi, Seyed Omid ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5448
Lastpage :
5451
Abstract :
It is well known that MFCC based speaker identification (SID) systems easily break down under mismatched training and test conditions. One such mismatch occurs when a SID system is trained on anechoic speech data, while test is carried out using reverberant data collected via a distant microphone. In this study, a new set of feature parameters based on the Hilbert envelope of Gammatone filterbank outputs is proposed to improve SID performance in the presence of room reverberation. Considering two distinct perceptual effects of reverberation on speech signals, i.e., coloration and long-term reverberation, two different compensation strategies are integrated within the feature extraction framework to effectively suppress the effects of reverberation. Experimental evaluation is performed using speech material from the TIMIT, four different measured room impulse responses (RIR) from Aachen impulse response (AIR) database, and a GMM-based SID system. Obtained results indicate significant improvement over the baseline system with MFCCs plus cepstral mean subtraction (CMS), confirming the effectiveness of the proposed feature parameters for SID under reverberant mismatched conditions.
Keywords :
Hilbert transforms; speaker recognition; AIR database; Aachen impulse response database; CMS; Gammatone filterbank; Hilbert envelope based features; MFCC; RIR; SID system; TIMIT; anechoic speech data; reverberant mismatched conditions; robust speaker identification; room impulse response; Accuracy; Feature extraction; Microphones; Reverberation; Robustness; Speech; Gammatone filterbank; Hilbert envelope; mismatched conditions; reverberation suppression; speaker identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947591
Filename :
5947591
Link To Document :
بازگشت