DocumentCode
3314715
Title
Using information theoretic vector quantization for inverted MFCC based speaker verification
Author
Memon, Sheeraz ; Lech, Margaret ; He, Ling
Author_Institution
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC
fYear
2009
fDate
17-18 Feb. 2009
Firstpage
1
Lastpage
5
Abstract
Over the recent years different versions the GMM classifier combined with the MFCC features have been established as speaker verification benchmarks. Although highly efficient, these systems suffer from computational complexity and occasional convergence problems. In this study a search of alternative classification and feature extraction methods of similar classification efficiency but overcoming some of the problems of the classical methods was undertaken. Preliminary results obtained for two different classification methods: the classical GMM and the ITVQ and three different feature extraction methods: MFCC, IMFCC and the MFCC/IMFCC fusion are presented. The ITVQ did not show better results compare to the classical GMM classifier, however the EER increase in case for the ITVQ was only by 0.2%. The best feature extraction method was proven to be the MFCC/IMFCC fusion. Both the MFCC/IMFCC fusion and the IMFCC outperformed the classical MFCC method.
Keywords
benchmark testing; computational complexity; feature extraction; information theory; speaker recognition; GMM classifier; ITVQ; MFCC-IMFCC fusion; Mel-frequency cepstrum coefficients; computational complexity; feature extraction methods; information theoretic vector quantization; inverted MFCC; occasional convergence problems; speaker verification; speaker verification benchmarks; Computational complexity; Data mining; Feature extraction; Helium; Humans; Loudspeakers; Mel frequency cepstral coefficient; Speaker recognition; Speech; Vector quantization; IMFCC; ITVQ; Information Theory; MFCC;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer, Control and Communication, 2009. IC4 2009. 2nd International Conference on
Conference_Location
Karachi
Print_ISBN
978-1-4244-3313-1
Electronic_ISBN
978-1-4244-3314-8
Type
conf
DOI
10.1109/IC4.2009.4909212
Filename
4909212
Link To Document