DocumentCode :
3660373
Title :
An improved robust statistical voice activity detection based on sub-band periodic intensity
Author :
Weijun He;Xiaohui Feng;Zhengyu Zhu;Weili Zhou
Author_Institution :
School of Electronic and Information Engineering, South China University of Technology, Guangzhou, Guangdong, China
fYear :
2015
Firstpage :
2171
Lastpage :
2175
Abstract :
From an investigation of the statistical model likelihood ratio test-based voice activity detection(VAD), it was discovered that there existed false alarm problem in detecting the non verbal vocalization signal. In this paper, an improved statistical model-based VAD method is proposed for noise adverse environments, which employs reserved coefficient in the decision rule. The reserved coefficient is determined by sub-bands periodic intensity, sub-bands are divided on the basis of human auditory sensing characteristic. The final decision depends upon the geometric mean of the reserved sub-band likelihood ratios. Simulation which is carried out on the CADCC and NOISEX-92 databases, shows its promising performance in comparison with traditional robust VAD methods in both stationary and nonstationary noise conditions, in terms of improved false alarm rate and receiver operating characteristic (ROC) curve.
Keywords :
"Speech","Speech processing","Correlation","Robustness","Signal to noise ratio","Frequency conversion"
Publisher :
ieee
Conference_Titel :
Information and Automation, 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICInfA.2015.7279647
Filename :
7279647
Link To Document :
بازگشت