DocumentCode
2139642
Title
A new multi-parameter dual-threshold state discrimination algorithm for voice activity detection
Author
Xinyan Zeng ; Guojun Zhao
Author_Institution
Minist. of Educ. Key Lab. of Mech. Manuf. &Autom., Zhejiang Univ. of Technol., Hangzhou, China
fYear
2013
fDate
23-25 July 2013
Firstpage
1239
Lastpage
1243
Abstract
The desired results cannot be achieved by applying generally accepted algorithm of voice activity detection (VAD). Against such background, a comprehensive algorithm featuring multiple parameters and dual-threshold state discrimination algorithm is proposed. This algorithm sets both a high and a low threshold based on the fundamental characteristic parameters such as the short-time average energy (STAE), the short-time zero-domain-crossing rate (ZDCR) and the time duration, with reference to the design of finite state method. The interval between these two thresholds helps to detect voice activity with the improved results. After the new algorithm was compared and analyzed through Matlab, it experimentally tested on a DSP-based processing system. It proves that it is more effective and accurate than the widely accepted one.
Keywords
speech processing; DSP based processing system; STAE; VAD; ZDCR; comprehensive algorithm; finite state method; fundamental characteristic parameters; new multiparameter dual threshold state discrimination algorithm; short-time average energy; time duration; voice activity detection; zero domain crossing rate; Algorithm design and analysis; Interference; Signal processing algorithms; Signal to noise ratio; Speech; Speech recognition; voice activity detection (VAD); voice signal; zero-crossing rate (ZCR);
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2013 Ninth International Conference on
Conference_Location
Shenyang
Type
conf
DOI
10.1109/ICNC.2013.6818168
Filename
6818168
Link To Document