• DocumentCode
    417218
  • Title

    A differential spectral voice activity detector

  • Author

    Garner, Philip N. ; Fukada, Toshiaki ; Komori, Yasuhiro

  • Author_Institution
    Canon Inc, Tokyo, Japan
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The voice activity detection (VAD) problem is placed into a decision theoretic framework, and the Gaussian VAD model of Sohn et al. (1998, 1999) is then shown to fit well with the framework. It is argued that the Gaussian model can be made more robust to correlation and expected spectral shapes of speech and noise by using a differential spectral representation. Such a model is formulated theoretically. The differential spectral VAD is then shown by experiment to compare favourably with the basic Gaussian VAD in a speech recognition setting, especially for noisy environments.
  • Keywords
    Gaussian distribution; decision theory; signal representation; spectral analysis; speech recognition; Gaussian VAD model; correlation robustness; differential spectral representation; spectral shapes; speech recognition; voice activity detector; Automatic speech recognition; Cost function; Detectors; Gaussian noise; Noise robustness; Noise shaping; Spectral shape; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326056
  • Filename
    1326056