Title :
A new multi-parameter dual-threshold state discrimination algorithm for voice activity detection
Author :
Xinyan Zeng ; Guojun Zhao
Author_Institution :
Minist. of Educ. Key Lab. of Mech. Manuf. &Autom., Zhejiang Univ. of Technol., Hangzhou, China
Abstract :
The desired results cannot be achieved by applying generally accepted algorithm of voice activity detection (VAD). Against such background, a comprehensive algorithm featuring multiple parameters and dual-threshold state discrimination algorithm is proposed. This algorithm sets both a high and a low threshold based on the fundamental characteristic parameters such as the short-time average energy (STAE), the short-time zero-domain-crossing rate (ZDCR) and the time duration, with reference to the design of finite state method. The interval between these two thresholds helps to detect voice activity with the improved results. After the new algorithm was compared and analyzed through Matlab, it experimentally tested on a DSP-based processing system. It proves that it is more effective and accurate than the widely accepted one.
Keywords :
speech processing; DSP based processing system; STAE; VAD; ZDCR; comprehensive algorithm; finite state method; fundamental characteristic parameters; new multiparameter dual threshold state discrimination algorithm; short-time average energy; time duration; voice activity detection; zero domain crossing rate; Algorithm design and analysis; Interference; Signal processing algorithms; Signal to noise ratio; Speech; Speech recognition; voice activity detection (VAD); voice signal; zero-crossing rate (ZCR);
Conference_Titel :
Natural Computation (ICNC), 2013 Ninth International Conference on
Conference_Location :
Shenyang
DOI :
10.1109/ICNC.2013.6818168