Title :
Robust Q Features for Speaker Identification
Author :
Deshpande, Mangesh S. ; Holambe, Raghunath S.
Author_Institution :
Dept. of Instrum. Eng., SGGS Inst. of Eng. & Technol., Nanded, India
Abstract :
In this paper, a nonlinear AM-FM speech model is used to extract robust features for speaker identification. The proposed features measure the amount of amplitude and frequency modulation that the commonly used linear source-filter model and the Mel frequency cepstral coefficients (MFCC) feature fails to capture. From the short time estimates of the frequency and bandwidth, a novel set of features is proposed. The robustness and discriminability of the features is investigated in comparison with the MFCC features using the clean speech corpus from TIMIT database and noise from the NOISEX-92 database. The proposed feature set provides significant improvement in the identification accuracy over the MFCC features under mismatched training and testing environments. The results show that better speaker identification rates are attainable under mismatched conditions especially at low signal-to-noise ratio (SNR).
Keywords :
amplitude modulation; cepstral analysis; feature extraction; frequency modulation; speaker recognition; Mel frequency cepstral coefficients feature; NOISEX-92 database; Q feature; SNR; TIMIT database; amplitude modulation; feature extraction; frequency modulation; linear source-filter model; nonlinear AM-FM speech model; signal-to-noise ratio; speaker identification; Feature extraction; Frequency estimation; Frequency measurement; Frequency modulation; Mel frequency cepstral coefficient; Robustness; Signal to noise ratio; Spatial databases; Speech; Working environment noise; AM-FM model; GMM; MFCC; speaker identification;
Conference_Titel :
Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on
Conference_Location :
Kottayam, Kerala
Print_ISBN :
978-1-4244-5104-3
Electronic_ISBN :
978-0-7695-3845-7
DOI :
10.1109/ARTCom.2009.75