• DocumentCode
    454624
  • Title

    An Analysis of Visual Speech Information Applied to Voice Activity Detection

  • Author

    Sodoyer, David ; Rivet, Bertrand ; Girin, Laurent ; Schwartz, Jean-Luc ; Jutten, Christian

  • Author_Institution
    Inst. of Speech Commun., CNRS, Grenoble
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    We present a new approach to the voice activity detection (VAD) problem for speech signals embedded in non-stationary noise. The method is based on automatic lipreading: the objective is to detect voice activity or non-activity by exploiting the coherence between the speech acoustic signal and the speaker´s lip movements. From a comprehensive analysis of lip shape parameters during speech and non-speech events, we show that a single appropriate visual parameter, defined to characterize the lip movements, can be used for the detection of sections of voice activity or more precisely, for the detection of silence sections. Detection scores obtained on spontaneous speech confirm the efficiency of the visual voice activity detector (VVAD)
  • Keywords
    face recognition; gesture recognition; speech recognition; automatic lipreading; lip movements; nonspeech events; nonstationary noise; speech acoustic signal; visual speech information; visual voice activity detector; voice activity detection; Acoustic noise; Acoustic signal detection; Background noise; Detectors; Event detection; Information analysis; Signal processing; Speech analysis; Speech enhancement; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660092
  • Filename
    1660092