DocumentCode :
3292462
Title :
Speaker Recognition with VAD
Author :
Ling, Jian ; Sun, Shuifa ; Zhu, Jianwei ; Liu, Xiaoli
Author_Institution :
Sch. of Inf. Eng., Zhejiang Univ. of Media & Commun., Hangzhou, China
fYear :
2009
fDate :
6-7 June 2009
Firstpage :
313
Lastpage :
315
Abstract :
This work is mainly focused on showing experimental results of speaker recognition with voice activity detection. A VAD algorithm based on the finite state machine is introduced firstly. The algorithm is incorporated into two speaker recognition (SR)systems. The mel frequency ceptral coefficients(MFCCs) are adopted as the speaker speech feature parameters in both systems. Vector quantization (VQ)and Gaussian mixture model (GMM) are the classifiers of the two SR systems, respectively. The experimental results show that the VAD improved the performance of both SR systems with small speech database. However, as the speech databases get bigger and bigger, the performance of both SR systems with VAD gets worse and worse, compared to those of systems without VAD. The reason of the phenomenon is analyzed in detail.
Keywords :
Gaussian processes; feature extraction; finite state machines; speaker recognition; vector quantisation; GMM; Gaussian mixture model; MFCC; SR; VAD; VQ; finite state machine; mel frequency ceptral coefficient; speaker recognition; speech feature parameter; vector quantization; voice activity detection; Automata; Automatic speech recognition; Filter bank; Frequency; Microphones; Spatial databases; Speaker recognition; Speech processing; Strontium; Vectors; FSM; VAD; speaker recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Mining and Web-based Application, 2009. WMWA '09. Second Pacific-Asia Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3646-0
Type :
conf
DOI :
10.1109/WMWA.2009.59
Filename :
5232527
Link To Document :
بازگشت