DocumentCode :
700229
Title :
Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech
Author :
Das, Amitava ; Chittaranjan, Gokul ; Srinivasan, V.
Author_Institution :
Microsoft Res. Lab. - India, Bangalore, India
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
Prevalent speaker recognition methods use only spectral-envelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporal-spectral dynamics of the entire speech signal. We propose a new feature for speaker recognition based on sinusoidal representation of speech called compressed spectral dynamics (Sinogram-CSD), which effectively captures such spectral dynamics and the inherent speaker identity. The discriminative power of CSD allows classification to remain simple. The proposed CSD-MSRI method uses a simple nearest neighbor classifier to deliver performance competitive to conventional MFCC+DTW based text-dependent speaker recognition methods at significantly lower complexity.
Keywords :
computational complexity; feature extraction; signal classification; signal representation; speaker recognition; spectral analysis; compressed feature spectral dynamic; sinogram-CSD-MSRI method; spectral-envelope based feature; speech sinusoidal representation; temporal- spectral dynamics; text-dependent speaker recognition; Complexity theory; Hidden Markov models; Mel frequency cepstral coefficient; Speaker recognition; Spectrogram; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080761
Link To Document :
بازگشت