• DocumentCode
    700229
  • Title

    Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech

  • Author

    Das, Amitava ; Chittaranjan, Gokul ; Srinivasan, V.

  • Author_Institution
    Microsoft Res. Lab. - India, Bangalore, India
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Prevalent speaker recognition methods use only spectral-envelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporal-spectral dynamics of the entire speech signal. We propose a new feature for speaker recognition based on sinusoidal representation of speech called compressed spectral dynamics (Sinogram-CSD), which effectively captures such spectral dynamics and the inherent speaker identity. The discriminative power of CSD allows classification to remain simple. The proposed CSD-MSRI method uses a simple nearest neighbor classifier to deliver performance competitive to conventional MFCC+DTW based text-dependent speaker recognition methods at significantly lower complexity.
  • Keywords
    computational complexity; feature extraction; signal classification; signal representation; speaker recognition; spectral analysis; compressed feature spectral dynamic; sinogram-CSD-MSRI method; spectral-envelope based feature; speech sinusoidal representation; temporal- spectral dynamics; text-dependent speaker recognition; Complexity theory; Hidden Markov models; Mel frequency cepstral coefficient; Speaker recognition; Spectrogram; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080761