DocumentCode :
3272420
Title :
Robust voice activity detection algorithm based on the perceptual wavelet packet transform
Author :
Chen, Shi-Huang ; Wu, Hsin-Te ; Chen, Chia-Hsiang ; Ruan, Jiun-Ching ; Truong, T.K.
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Shu-Te Univ., Kaohsiung, Taiwan
fYear :
2005
fDate :
13-16 Dec. 2005
Firstpage :
45
Lastpage :
48
Abstract :
In this paper, a robust voice activity detection (VAD) algorithm based on the perceptual wavelet packet transform (PWPT) is proposed. The first step of this new VAD algorithm is to make use of the PWPT to decompose the input speech into 17 critical subband signals. To enhance energy of voice frames and decay energy of unvoice frames, the voice activity shape (VAS) is derived from the Teager energy operator (TEO) of these critical subband signals. Then the adaptive weighted threshold (AWT) value can be calculated from the second derivative recursive mean (SDRM) of the VAS and environments noise estimation. It is shown in this paper that the AWT is a robust threshold value for VAD under various noisy environments. One of advantages of this new algorithm is that the preset threshold values are not necessary. In addition, the proposed algorithm can adapt VAD threshold value to variable speech conditions. Experimental results show that the new VAD algorithm outperforms the G.729B and adaptive multi rate (AMR) VAD.
Keywords :
adaptive signal processing; speech processing; voice communication; wavelet transforms; Teager energy operator; adaptive multirate method; adaptive weighted threshold; noise estimation; perceptual wavelet packet transform; second derivative recursive mean; voice activity detection algorithm; voice activity shape; Bandwidth; Detection algorithms; Energy resolution; Robustness; Signal resolution; Speech processing; Time frequency analysis; Wavelet packets; Wavelet transforms; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on
Print_ISBN :
0-7803-9266-3
Type :
conf
DOI :
10.1109/ISPACS.2005.1595342
Filename :
1595342
Link To Document :
بازگشت