Title :
Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech
Author :
Das, Amitava ; Chittaranjan, Gokul ; Srinivasan, V.
Author_Institution :
Microsoft Res. Lab. - India, Bangalore, India
Abstract :
Prevalent speaker recognition methods use only spectral-envelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporal-spectral dynamics of the entire speech signal. We propose a new feature for speaker recognition based on sinusoidal representation of speech called compressed spectral dynamics (Sinogram-CSD), which effectively captures such spectral dynamics and the inherent speaker identity. The discriminative power of CSD allows classification to remain simple. The proposed CSD-MSRI method uses a simple nearest neighbor classifier to deliver performance competitive to conventional MFCC+DTW based text-dependent speaker recognition methods at significantly lower complexity.
Keywords :
computational complexity; feature extraction; signal classification; signal representation; speaker recognition; spectral analysis; compressed feature spectral dynamic; sinogram-CSD-MSRI method; spectral-envelope based feature; speech sinusoidal representation; temporal- spectral dynamics; text-dependent speaker recognition; Complexity theory; Hidden Markov models; Mel frequency cepstral coefficient; Speaker recognition; Spectrogram; Speech; Speech recognition;
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne