DocumentCode :
2511909
Title :
Noise-Robust Voice Activity Detector Based on Hidden Semi-Markov Models
Author :
Liu, Xianglong ; Liang, Yuan ; Lou, Yihua ; Li, He ; Shan, Baosong
Author_Institution :
State Key Lab. of Software Dev. Environ., Beihang Univ., Beijing, China
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
81
Lastpage :
84
Abstract :
This paper concentrates on speech duration distributions that are usually invariant to noises and proposes a noise-robust and real-time voice activity detector (VAD) using the hidden semi-Markov model (HSMM) to explicitly model state durations. Motivated by statistical observations and tests on TIMIT and the IEEE sentence database, we use Weibull distributions to model state durations approximately and estimate their parameters by maximum likelihood estimators. The final VAD decision is made according to the likelihood ratio test (LRT) incorporating state prior knowledge and modified forward variables. An efficient way that recursively calculates modified forward variables is devised and a dynamic adjustment scheme is used to update parameters. Experiments on noisy speech data show that the proposed method performs more robustly and accurately than the standard ITU-T G.729B VAD and AMR2.
Keywords :
Weibull distribution; hidden Markov models; maximum likelihood estimation; speech processing; AMR2; HSMM; IEEE sentence database; Weibull distributions; hidden semiMarkov models; likelihood ratio test; model state durations; noise-robust voice activity detector; standard ITU-T G.729B VAD; statistical observations; Databases; Hidden Markov models; Noise; Noise measurement; Robustness; Speech; Weibull distribution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.28
Filename :
5597633
Link To Document :
بازگشت