Title :
Robust voice activity detection for DTX operation of speech coders
Author :
Basbug, Filiz ; Nandkumar, S. ; Swaminathan, Karthik
Author_Institution :
Hughes Network Syst. Inc., Germantown, MD, USA
Abstract :
Robust detection of voice activity for short-term speech frames is essential for discontinuous transmission (DTX) mode of operation of vocoders such as IS-641. A reference VAD for the IS-641 coder has been chosen for such a purpose and is based on the GSM-EFR (enhance full rate) VAD. We show by developing a comprehensive evaluation procedure that the reference VAD is sensitive to speech level variations. For example, a significant increase is seen in frames falsely classified as active at speech levels of 10 dB above or below nominal level. We propose a solution based on automatic gain control to reduce level sensitivity. Objective performance measures confirm the robustness of our proposed VAD
Keywords :
acoustic signal detection; automatic gain control; speech coding; vocoders; DTX operation; GSM-EFR VAD; IS-641 coder; automatic gain control; discontinuous transmission mode; enhance full rate; objective performance measures; robust detection; sensitivity reduction; short-term speech frames; speech coders; speech level variations; vocoders; voice activity detection; Base stations; Battery charge measurement; GSM; Gain control; Robust control; Robustness; Speech analysis; Speech enhancement; Statistics; Vocoders;
Conference_Titel :
Speech Coding Proceedings, 1999 IEEE Workshop on
Conference_Location :
Porvoo
Print_ISBN :
0-7803-5651-9
DOI :
10.1109/SCFT.1999.781483