Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech

Author

Das, Amitava ; Chittaranjan, Gokul ; Srinivasan, V.

Author_Institution

Microsoft Res. Lab. - India, Bangalore, India

fYear

2008

fDate

25-29 Aug. 2008

Firstpage

1

Lastpage

5

Abstract

Prevalent speaker recognition methods use only spectral-envelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporal-spectral dynamics of the entire speech signal. We propose a new feature for speaker recognition based on sinusoidal representation of speech called compressed spectral dynamics (Sinogram-CSD), which effectively captures such spectral dynamics and the inherent speaker identity. The discriminative power of CSD allows classification to remain simple. The proposed CSD-MSRI method uses a simple nearest neighbor classifier to deliver performance competitive to conventional MFCC+DTW based text-dependent speaker recognition methods at significantly lower complexity.

Keywords

computational complexity; feature extraction; signal classification; signal representation; speaker recognition; spectral analysis; compressed feature spectral dynamic; sinogram-CSD-MSRI method; spectral-envelope based feature; speech sinusoidal representation; temporal- spectral dynamics; text-dependent speaker recognition; Complexity theory; Hidden Markov models; Mel frequency cepstral coefficient; Speaker recognition; Spectrogram; Speech; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2008 16th European

Conference_Location

Lausanne

ISSN

2219-5491

Type

conf

Filename

7080761