DocumentCode :
1797326
Title :
Hybrid SVM/HMM architectures for statistical model-based voice activity detection
Author :
Ying-Wei Tan ; Wen-Ju Liu ; Wei Jiang ; Hao Zheng
Author_Institution :
Dept. of Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
2875
Lastpage :
2878
Abstract :
The decision function of support vector machine (SVM) using the likelihood ratios (LRs) is successfully used for statistical model-based voice activity detection (VAD). It is known to incorporate an optimised nonlinear decision over two different classes, instead of comparing the geometric mean of the LRs for the individual frequency bands with a given threshold for speech detection. However, the inter-frame correlation of the voice activity is not taken into consideration. In this paper, we explore a hybrid SVM/hidden Markov model (HMM) approach for the VAD, which retains discriminative and nonlinear properties of SVM, while modeling the interframe correlation powerfully through a first-order HMM. Experimental results show the significant improvement of the performance of the proposed VAD in comparison with the SVM-based VAD.
Keywords :
hidden Markov models; maximum likelihood estimation; speech processing; support vector machines; HMM; LRs; SVM; VAD; hidden Markov model; likelihood ratios; speech detection; statistical model-based voice activity detection; support vector machine; Correlation; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889403
Filename :
6889403
Link To Document :
بازگشت