• DocumentCode
    433120
  • Title

    Audio-visual flow-a variational approach to multimodal flow estimation

  • Author

    Hamid, Ruffay ; Bobick, Aaron ; Yezzi, Anthony

  • Author_Institution
    GVU Center, Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    4
  • fYear
    2004
  • fDate
    24-27 Oct. 2004
  • Firstpage
    2563
  • Abstract
    Just as a motion field is associated to a moving object, an audio field can be associated to an object that can behave as a sound source. The flow field of such a sound source which moves over time would not only have an optical component, but also an audio component; something we call audio-visual flow. In this paper we present a common structure tensor based variational framework for dense audiovisual flow-field estimation. The proposed scheme improves the rank of the local structure tensor by incorporating an audio information channel which is substantially uncorrelated from the complementing visual information channel. The scheme allows ascribing weights to individual sensor modalities based on the confidence in their corresponding measurements. Results are presented to demonstrate how combining multiple modalities in our proposed framework can provide a possible solution to temporary full visual occlusions.
  • Keywords
    audio signal processing; audio-visual systems; hidden feature removal; image sensors; image sequences; probability; sensor fusion; video signal processing; audio component; audio information channel; audio-visual flow; dense audiovisual flow-field estimation; local structure tensor; moving object; multimodal flow estimation; optical component; sensor modality; sound source; visual occlusion; Image motion analysis; Layout; Motion estimation; Optical devices; Optical noise; Optical sensors; Optical variables control; Signal analysis; Tensile stress; Videoconference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2004. ICIP '04. 2004 International Conference on
  • ISSN
    1522-4880
  • Print_ISBN
    0-7803-8554-3
  • Type

    conf

  • DOI
    10.1109/ICIP.2004.1421626
  • Filename
    1421626