• DocumentCode
    780156
  • Title

    A Soft Voice Activity Detection Using GARCH Filter and Variance Gamma Distribution

  • Author

    Tahmasbi, Rasool ; Rezaei, Sadegh

  • Author_Institution
    Amir Kabir Univ., Tehran
  • Volume
    15
  • Issue
    4
  • fYear
    2007
  • fDate
    5/1/2007 12:00:00 AM
  • Firstpage
    1129
  • Lastpage
    1134
  • Abstract
    This paper presents a robust algorithm for a voice activity detector (VAD) based on generalized autoregressive conditional heteroscedasticity (GARCH) filter, variance gamma distribution (VGD), and adaptive threshold function. GARCH models are new statistical methods that are used especially in economic time series. There is a consensus that speech signals exhibit variances that change through time. GARCH models are a popular choice to model these changing variances. A speech signal is assumed to have a VGD because the VGD has heavier tails than the Gaussian distribution (GD). The distribution of noise signal is assumed to be Gaussian. In proposed method, heteroscedasticity will be modeled by GARCH, and then the parameters of the distributions will be estimated recursively. Finally, hard detection is the result of comparing a multiple observation likelihood ratio test (MOLRT) with an adaptive threshold function. The simulation results show that the proposed VAD is able to operate down to -5 dB and in nonstationary environments
  • Keywords
    filtering theory; gamma distribution; speech processing; speech recognition; statistical analysis; GARCH filter; adaptive threshold function; generalized autoregressive conditional heteroscedasticity; multiple observation likelihood ratio test; soft voice activity detection; speech signals; statistical methods; variance gamma distribution; Adaptive filters; Change detection algorithms; Environmental economics; Gamma ray detection; Gamma ray detectors; Gaussian distribution; Probability distribution; Robustness; Speech; Statistical analysis; Estimation theory; generalized autoregressive conditional heteroscedasticity (GARCH) model; heteroscedasticity; probability distribution; voice activity detection (VAD);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.894521
  • Filename
    4156217