مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

3292462

Title :

Speaker Recognition with VAD

Author :

Ling, Jian ; Sun, Shuifa ; Zhu, Jianwei ; Liu, Xiaoli

Author_Institution :

Sch. of Inf. Eng., Zhejiang Univ. of Media & Commun., Hangzhou, China

fYear :

2009

fDate :

6-7 June 2009

Firstpage :

313

Lastpage :

315

Abstract :

This work is mainly focused on showing experimental results of speaker recognition with voice activity detection. A VAD algorithm based on the finite state machine is introduced firstly. The algorithm is incorporated into two speaker recognition (SR)systems. The mel frequency ceptral coefficients(MFCCs) are adopted as the speaker speech feature parameters in both systems. Vector quantization (VQ)and Gaussian mixture model (GMM) are the classifiers of the two SR systems, respectively. The experimental results show that the VAD improved the performance of both SR systems with small speech database. However, as the speech databases get bigger and bigger, the performance of both SR systems with VAD gets worse and worse, compared to those of systems without VAD. The reason of the phenomenon is analyzed in detail.

Keywords :

Gaussian processes; feature extraction; finite state machines; speaker recognition; vector quantisation; GMM; Gaussian mixture model; MFCC; SR; VAD; VQ; finite state machine; mel frequency ceptral coefficient; speaker recognition; speech feature parameter; vector quantization; voice activity detection; Automata; Automatic speech recognition; Filter bank; Frequency; Microphones; Spatial databases; Speaker recognition; Speech processing; Strontium; Vectors; FSM; VAD; speaker recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Web Mining and Web-based Application, 2009. WMWA '09. Second Pacific-Asia Conference on

Conference_Location :

Wuhan

Print_ISBN :

978-0-7695-3646-0

Type :

conf

DOI :

10.1109/WMWA.2009.59

Filename :

5232527

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3292462