Title :
Hybrid SVM/HMM architectures for statistical model-based voice activity detection
Author :
Ying-Wei Tan ; Wen-Ju Liu ; Wei Jiang ; Hao Zheng
Author_Institution :
Dept. of Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
Abstract :
The decision function of support vector machine (SVM) using the likelihood ratios (LRs) is successfully used for statistical model-based voice activity detection (VAD). It is known to incorporate an optimised nonlinear decision over two different classes, instead of comparing the geometric mean of the LRs for the individual frequency bands with a given threshold for speech detection. However, the inter-frame correlation of the voice activity is not taken into consideration. In this paper, we explore a hybrid SVM/hidden Markov model (HMM) approach for the VAD, which retains discriminative and nonlinear properties of SVM, while modeling the interframe correlation powerfully through a first-order HMM. Experimental results show the significant improvement of the performance of the proposed VAD in comparison with the SVM-based VAD.
Keywords :
hidden Markov models; maximum likelihood estimation; speech processing; support vector machines; HMM; LRs; SVM; VAD; hidden Markov model; likelihood ratios; speech detection; statistical model-based voice activity detection; support vector machine; Correlation; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; Support vector machines;
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
DOI :
10.1109/IJCNN.2014.6889403