DocumentCode :
2492737
Title :
Effect of MFCC normalization on vector quantization based speaker identification
Author :
Shirali-Shahreza, M. Hassan ; Shirali-Shahreza, Sajad
Author_Institution :
Virtual Educ. Grad. Coll., Amirkabir Univ. of Technol., Tehran, Iran
fYear :
2010
fDate :
15-18 Dec. 2010
Firstpage :
250
Lastpage :
253
Abstract :
Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature vector. Another pre-processing is coefficients mean normalization. In this paper, the effect of these two processes on the accuracy of a Vector Quantization (VQ) speaker identification system is compared. Additionally, it is shown that coefficient variance normalization, which is less common, can improve the accuracy.
Keywords :
cepstral analysis; speaker recognition; vector quantisation; MFCC; delta coefficients; delta-delta coefficients; mean normalization; mel frequency cepstral coefficients; speaker identification; speech recognition; vector quantization; Accuracy; Databases; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech recognition; Vector quantization; Mel Frequency Cepstral Coefficients (MFCC); Normalization; Speaker Recognition; Vector Quantization (VQ);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology (ISSPIT), 2010 IEEE International Symposium on
Conference_Location :
Luxor
Print_ISBN :
978-1-4244-9992-2
Type :
conf
DOI :
10.1109/ISSPIT.2010.5711789
Filename :
5711789
Link To Document :
بازگشت