DocumentCode
590716
Title
An investigation into better frequency warping for time-varying speaker recognition
Author
Linlin Wang ; Xiaojun Wu ; Zheng, Thomas Fang ; Chenhao Zhang
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear
2012
fDate
3-6 Dec. 2012
Firstpage
1
Lastpage
4
Abstract
Performance degradation has been observed in presence of time intervals in practical speaker recognition systems. Researchers usually resort to enrollment data augmentation, speaker model adaptation, and variable verification threshold to alleviate the time-varying impact. However, in this paper, efforts have been made in the feature domain and an investigation into better frequency warping for the target task has been done. Two methods to determine the discrimination sensitivity of frequency bands are explored: an energy-based F-ratio measure and a performance-driven one. Frequency warping is performed according to the discrimination sensitivity curves of the whole frequency range. Experimental results show that the proposed features outperform both MFCCs and LFCCs, and to some extent, alleviate the time-varying impact on speaker recognition.
Keywords
feature extraction; speaker recognition; discrimination sensitivity curves; energy-based F-ratio measure; enrollment data augmentation; feature domain; frequency warping; speaker model adaptation; time-varying speaker recognition; variable verification threshold; Frequency conversion; Sensitivity; Speaker recognition; Spectrogram; Speech recognition; Time frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location
Hollywood, CA
Print_ISBN
978-1-4673-4863-8
Type
conf
Filename
6411863
Link To Document