• DocumentCode
    383811
  • Title

    A wavelet-based voice activity detection algorithm in noisy environments

  • Author

    Chen, Shi-Huang ; Wang, Jhing-Fa

  • Author_Institution
    Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    3
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    995
  • Abstract
    This paper presents a new voice activity detection (VAD) algorithm based on the perceptual wavelet packet transform (PWPT) and the Teager energy operator (TEO). The basic procedure of the proposed VAD algorithm is to make use of the PWPT to decompose the input speech into critical subband signals. Then a parameter called voice activity shape (VAS) can be derived from the TEO of these critical subband signals. It is shown in this paper that the VAS can be used as a robust feature for VAD. The advantage of this new algorithm is that the preset threshold values or a priori knowledge of the SNR usually needed in conventional VAD methods can be completely avoided. Various experimental results show that the proposed VAD algorithm is capable of outperforming to the ITU-T G.729B VAD and can operate reliably in real noisy environments.
  • Keywords
    acoustic noise; mathematical operators; speech intelligibility; speech processing; speech recognition; telecommunication standards; wavelet transforms; ITU-T G.729B VAD; PWPT; SNR a priori knowledge; SNR threshold values; TEO; Teager energy operator; VAD algorithm; critical subband signals; input speech decomposition; noisy environments; perceptual wavelet packet transform; voice activity detection algorithm; voice activity shape parameter; wavelet-based voice activity detection algorithm; Detection algorithms; Robustness; Shape; Speech coding; Speech enhancement; Speech processing; Time frequency analysis; Wavelet packets; Wavelet transforms; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Circuits and Systems, 2002. 9th International Conference on
  • Print_ISBN
    0-7803-7596-3
  • Type

    conf

  • DOI
    10.1109/ICECS.2002.1046417
  • Filename
    1046417