Title :
A robust voice activity detector based on Weibull and Gaussian Mixture distribution
Author :
Liang, Yuan ; Liu, Xianglong ; Zhou, Mi ; Lou, Yihua ; Shan, Baosong
Author_Institution :
State Key Lab. of Software Dev. Environ., Beihang Univ., Beijing, China
Abstract :
In this paper, we focus on the observation and state duration distributions in hidden semi-Markov model (HSMM)-based voice activity detection. To perform robustly in noisy environment, firstly, acoustic features of noisy speech are extracted by Mel-frequency cepstrum processor after filtering the raw speech with a modified Wiener filter. According to the statistic on TIMIT database, we use Gaussian Mixture distributions (GMD) for both speech and non-speech state to correlate the MFCC feature vectors and state sequences. The transition probability in HSMM is not a constant like in HMM but depends on the elapsed time in last state, and is modeled by Weibull distribution (WD) in this paper. The final VAD decision is made according to the likelihood ratio test (LRT) incorporating state prior knowledge. Also a adaptive threshold is used to achieve better detection results. Experiments on noisy speech data show that the proposed method performs more robustly and accurately than the standard ITU-T G.729B, AMR2, HMM-based VAD and VAD using Laplacian-Gaussian model.
Keywords :
Gaussian distribution; Weibull distribution; Wiener filters; hidden Markov models; speech recognition; Gaussian mixture distribution; Mel-frequency cepstrum processor; Weibull distribution; Wiener filter; hidden semiMarkov model; likelihood ratio test; robust voice activity detection; state sequence; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; Wiener filter; Gaussian Mixture Distribution; Voice Activity Detection; Weibull Distribution;
Conference_Titel :
Signal Processing Systems (ICSPS), 2010 2nd International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-6892-8
Electronic_ISBN :
978-1-4244-6893-5
DOI :
10.1109/ICSPS.2010.5555230