DocumentCode :
1716747
Title :
On text-independent speaker recognition via improved Vector Quantization method
Author :
Liu Ting-ting ; Guan Sheng-xiao
Author_Institution :
Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2013
Firstpage :
3912
Lastpage :
3916
Abstract :
The text-independent speaker recognition system is mainly constituted of three functional modules including speech pretreatment, the feature parameter extraction, and pattern matching judgment. The paper uses the MATLAB software to acquire a design of the system. The feature parameters utilized in this paper are Mel-Frequency Cepstrum Coefficients (MFCC) and their first-order differential characteristics. With the help of the Fisher criterion the number of the dimension of the feature parameters is decreased. Vector Quantization (VQ) model is applied to devise the optimal codebook. The paper suggests some modifications in order to improve the efficiency of the algorithm on the basis of high recognition rate: Fisher ratio of each dimensional parameter is used as weighing coefficient at the distance measurement; an approach to speeding up the search is proposed; Process the empty cell in the procedure of codebook formation. In addition, it discusses a few factors including the training and testing time, the dimension of the codebook, stopping and acceptance thresholds, which have an impact on identification accuracy rate by experimentation.
Keywords :
cepstral analysis; differential equations; distance measurement; feature extraction; pattern matching; speaker recognition; speech coding; speech synthesis; vector quantisation; Fisher criterion; Fisher ratio; MATLAB software; MFCC; Mel-frequency cepstrum coefficients; VQ model; acceptance thresholds; codebook dimension; codebook formation; distance measurement; feature parameter extraction; feature parameters; first-order differential characteristics; high recognition rate; identification accuracy rate; optimal codebook; pattern matching judgment; speech pretreatment; stopping thresholds; system design; text-independent speaker recognition system; three functional modules; vector quantization model; weighing coefficient; Feature extraction; Mel frequency cepstral coefficient; Speaker recognition; Speech; Testing; Training; Vectors; Fisher criterion; Mel-Frequency Cepstrum Coefficients (MFCC); feature extraction; speaker recognition; vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2013 32nd Chinese
Conference_Location :
Xi´an
Type :
conf
Filename :
6640103
Link To Document :
بازگشت