• DocumentCode
    2035792
  • Title

    Blind Audiovisual Source Separation using Sparse Representations

  • Author

    Casanovas, Anna Llagostera ; Monaci, Gianluca ; Vandergheynst, Pierre

  • Author_Institution
    Ecole Polytech. Fed. de Lausanne, Lausanne
  • Volume
    3
  • fYear
    2007
  • fDate
    Sept. 16 2007-Oct. 19 2007
  • Abstract
    In this work we present a method to jointly separate active audio and visual structures on a given mixture. Blind audiovisual source separation is achieved exploiting the coherence between a video signal and a one-microphone audio track. The efficient representation of audio and video sequences allows to build relationships between correlated structures on both modalities. Video structures exhibiting strong correlations with the audio signal and that are spatially close are grouped using a robust clustering algorithm that can count and localize audiovisual sources. Using such information and exploiting audio-video correlation, audio sources are also localized and separated. To the best of our knowledge this is the first blind audiovisual source separation algorithm conceived to deal with a video sequence and the corresponding mono audio signal.
  • Keywords
    audio signal processing; blind source separation; correlation methods; image representation; image sequences; video signal processing; audio sequences; audio-video correlation; audiovisual sources; blind audiovisual source separation; one-microphone audio tracking; robust clustering algorithm; video sequences; video signal processing; video sparse representation; Blind source separation; Clustering algorithms; MONOS devices; Matching pursuit algorithms; Robustness; Signal processing algorithms; Signal representations; Source separation; Video sequences; Video signal processing; Audiovisual processing; blind source separation; sparse signal representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2007. ICIP 2007. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4244-1437-6
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2007.4379306
  • Filename
    4379306