Title :
Robust voice-activity detection based on the wavelet transform
Author :
Stegmann, Joachim ; Schröder, Gerhard
Author_Institution :
Deutsche Telekom Berkom, Darmstadt, Germany
Abstract :
This paper describes a new approach to voice-activity detection (VAD) which is based on the wavelet transform (WT). The algorithm utilizes the WTs flexibility in the time-frequency resolution to compute robust parameters for VAD decision. Furthermore, it exhibits a low complexity and can be easily adapted to operate as a pre-processor for many speech-coding algorithms. Two versions of the wavelet-transform-based VAD (WT-VAD) are tested against the VAD of the ITU-T G.729 Annex B (G729) and the VAD of the GSM enhanced full-rate codec (GSM), respectively. For a variety of background-noise types the WT-VAD shows superior noise robustness to signal-to-noise ratios above 10 dB
Keywords :
signal detection; signal resolution; speech codecs; speech coding; speech processing; time-frequency analysis; transform coding; voice communication; wavelet transforms; GSM enhanced full-rate codec; ITU-T G.729 Annex B; SNR; algorithm; background noise types; pre-processor; robust parameters; robust voice-activity detection; signal-to-noise ratios; speech coding algorithms; time-frequency resolution; wavelet transform; Background noise; Bit rate; Discrete wavelet transforms; Filter bank; GSM; Robustness; Signal to noise ratio; Speech; Time frequency analysis; Wavelet transforms;
Conference_Titel :
Speech Coding For Telecommunications Proceeding, 1997, 1997 IEEE Workshop on
Conference_Location :
Pocono Manor, PA
Print_ISBN :
0-7803-4073-6
DOI :
10.1109/SCFT.1997.623915