• DocumentCode
    590716
  • Title

    An investigation into better frequency warping for time-varying speaker recognition

  • Author

    Linlin Wang ; Xiaojun Wu ; Zheng, Thomas Fang ; Chenhao Zhang

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Performance degradation has been observed in presence of time intervals in practical speaker recognition systems. Researchers usually resort to enrollment data augmentation, speaker model adaptation, and variable verification threshold to alleviate the time-varying impact. However, in this paper, efforts have been made in the feature domain and an investigation into better frequency warping for the target task has been done. Two methods to determine the discrimination sensitivity of frequency bands are explored: an energy-based F-ratio measure and a performance-driven one. Frequency warping is performed according to the discrimination sensitivity curves of the whole frequency range. Experimental results show that the proposed features outperform both MFCCs and LFCCs, and to some extent, alleviate the time-varying impact on speaker recognition.
  • Keywords
    feature extraction; speaker recognition; discrimination sensitivity curves; energy-based F-ratio measure; enrollment data augmentation; feature domain; frequency warping; speaker model adaptation; time-varying speaker recognition; variable verification threshold; Frequency conversion; Sensitivity; Speaker recognition; Spectrogram; Speech recognition; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
  • Conference_Location
    Hollywood, CA
  • Print_ISBN
    978-1-4673-4863-8
  • Type

    conf

  • Filename
    6411863