• DocumentCode
    82017
  • Title

    Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal

  • Author

    Meenakshi, G. Nisha ; Ghosh, Prasanta Kumar

  • Author_Institution
    Indian Inst. of Sci., Bangalore, India
  • Volume
    22
  • Issue
    11
  • fYear
    2015
  • fDate
    Nov. 2015
  • Firstpage
    1859
  • Lastpage
    1863
  • Abstract
    The goal in the whisper activity detection (WAD) is to find the whispered speech segments in a given noisy recording of whispered speech. Since whispering lacks the periodic glottal excitation, it resembles an unvoiced speech. This noise-like nature of the whispered speech makes WAD a more challenging task compared to a typical voice activity detection (VAD) problem. In this paper, we propose a feature based on the long term variation of the logarithm of the short-time sub-band signal energy for WAD. We also propose an automatic sub-band selection algorithm to maximally discriminate noisy whisper from noise. Experiments with eight noise types in four different signal-to-noise ratio (SNR) conditions show that, for most of the noises, the performance of the proposed WAD scheme is significantly better than that of the existing VAD schemes and whisper detection schemes when used for WAD.
  • Keywords
    signal detection; speech processing; WAD scheme; automatic subband selection algorithm; long-term log energy variation; periodic glottal excitation; robust whisper activity detection; subband signal; voice activity detection problem; whispered speech segments; Histograms; Noise measurement; Radio frequency; Signal processing algorithms; Signal to noise ratio; Speech; Long-term signal measure; sub-band selection; whisper activity detection; whispered speech;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2015.2439514
  • Filename
    7115062