• DocumentCode
    3051680
  • Title

    On using spectral gradient in conditional MAP criterion for robust voice activity detection

  • Author

    Jae-Hun Choi ; Joon-Hyuk Chang

  • Author_Institution
    Sch. of Electr. Eng., Hanyang Univ., Seoul, South Korea
  • fYear
    2012
  • fDate
    21-23 Sept. 2012
  • Firstpage
    370
  • Lastpage
    374
  • Abstract
    In this paper, we propose a novel approach to improve a statistical model-based voice activity detection (VAD) method based on a modified conditional maximum a posteriori (MAP) criterion incorporating the spectral gradient scheme. The proposed conditional MAP incorporates not only the voice activity decision in the previous frame as in Ref. [1] but also the spectral gradient of the observed spectra between the current frame and the past frames to efficiently exploit the inter-frame correlation of voice activity. As a result, the proposed VAD leads to six separate thresholds to be adaptively determined in the likelihood ratio test (LRT) depending on both the previous VAD result and the estimated spectral gradient parameter. Experimental results demonstrate that the proposed approach yields better results compared to those of the previous conditional MAP-based method.
  • Keywords
    maximum likelihood estimation; speech recognition; MAP criterion; VAD method; conditional MAP-based method; conditional map criterion; estimated spectral gradient parameter; inter-frame correlation; likelihood ratio test; modified conditional maximum a posteriori; robust voice activity detection; spectral gradient; spectral gradient scheme; statistical model-based voice activity detection; Correlation; Laplace equations; Noise measurement; Signal to noise ratio; Speech; Speech processing; Conditional MAP; Likelihood ratio test; Spectral gradient; Voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4673-2201-0
  • Type

    conf

  • DOI
    10.1109/ICNIDC.2012.6418777
  • Filename
    6418777