• DocumentCode
    1835413
  • Title

    Consideration of Lombard effect for speechreading

  • Author

    Huang, Fu Jie ; Chen, Tsuhan

  • Author_Institution
    Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    613
  • Lastpage
    618
  • Abstract
    We propose a method for integrating audio and visual information to enhance speech recognition in adverse environments. We train the audio hidden Markov model and the visual hidden Markov model separately, and then use a Viterbi algorithm to decode both channels in parallel. The decoding process is asynchronous between the two channels to capture the asynchronous nature of audio and visual speech. We test the proposed method using speech corrupted by various types of noise and speech with the Lombard effect
  • Keywords
    Viterbi decoding; acoustic noise; audio coding; hidden Markov models; speech recognition; video coding; Lombard effect; adverse environments; asynchronous Viterbi decoding; enhanced speech recognition; hidden Markov model; integrated audio-visual information; lip-reading; speechreading; Automatic speech recognition; Background noise; Decoding; Degradation; Hidden Markov models; Speech processing; Speech recognition; State-space methods; Testing; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 2001 IEEE Fourth Workshop on
  • Conference_Location
    Cannes
  • Print_ISBN
    0-7803-7025-2
  • Type

    conf

  • DOI
    10.1109/MMSP.2001.962800
  • Filename
    962800