Title :
Effect of feature warping and decorrelation on Mel Filterbank Slope for speaker recognition
Author :
Madikeri, Srikanth ; Murthy, Hema A.
Author_Institution :
Dept. of Compute Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
Abstract :
Mel Filterbank Slope (MFS) feature has been shown to consistently perform better than the conventional Mel Frequency Cepstral Co-efficients (MFCC) for speaker recognition. In this work, the issues with respect to the feature´s robustness to intersession variability and large dimensionality are addressed. Short term feature warping is used to improve the robustness of MFS. This is observed to give an absolute improvement of 1% in EER on NIST 2003 SRE benchmark dataset. Dimensionality reduction on raw MFS features is performed using Discrete Cosine Transform (DCT). Efficient reduction is obtained using DCT with no deterioration in performance. Feature warping along with DCT is observed to give an absolute improvement of 2% in EER. An overall performance improvement of 3.3% is shown when the feature is fused with temporal information from MFCC.
Keywords :
discrete cosine transforms; speaker recognition; benchmark dataset; decorrelation; discrete cosine transform; feature warping; intersession variability; mel filterbank slope; robustness; speaker recognition; Discrete cosine transforms; Feature extraction; Mel frequency cepstral coefficient; Robustness; Speaker recognition; Speech; Vectors; channel compensation; feature warping; speaker recognition;
Conference_Titel :
Signal Processing and Communications (SPCOM), 2012 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4673-2013-9
DOI :
10.1109/SPCOM.2012.6290222