DocumentCode
3542167
Title
Improved voice activity detection via contextual information and noise suppression
Author
Sangwan, Abhijeet ; Zhu, W.P. ; Ahmad, M.O.
Author_Institution
Dept. of Electr. & Electron. Eng., Concordia Univ., Montreal, Que., Canada
fYear
2005
fDate
23-26 May 2005
Firstpage
868
Abstract
In this paper, we develop a contextual voice activity detection (VAD) scheme which combines both contextual and frame specific information to improve detection. Unlike many VAD algorithms which assume that the cues to activity lie within the frame alone, our scheme seeks information for activity in the current as well as the neighboring frames. The new approach provides good robustness in low SNR when the speech frame is corrupted and an alternate reliable source of activity information is necessary. Further, we present a simple noise suppression scheme to enhance the VAD performance at low SNR. The noise suppressor provides spectrally reshaped signal to the VAD. Finally, we combine the contextual VAD and the noise suppression scheme with a basic detector to form a comprehensive VAD. The proposed comprehensive VAD system is tested on speech samples from the SWITCHBOARD database. Various noises under different SNRs are added to the speech signals. Experimental results show that the proposed VAD outperforms the standard algorithm ETSI AMR VAD-1.
Keywords
signal denoising; speech processing; contextual information based VAD; frame specific information; low SNR region robustness; noise suppression; signal spectral reshaping; voice activity detection; Acoustic noise; Cascading style sheets; Context; Detectors; Frequency; Hidden Markov models; Low-frequency noise; Signal processing algorithms; Signal to noise ratio; Speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN
0-7803-8834-8
Type
conf
DOI
10.1109/ISCAS.2005.1464726
Filename
1464726
Link To Document