Title :
Audio-visual saliency map: Overview, basic models and hardware implementation
Author :
Ramenahalli, Sudarshan ; Mendat, Daniel R. ; Dura-Bernal, Salvador ; Culurciello, Eugenio ; Nieburt, Ernst ; Andreou, A.G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
In this paper we provide an overview of audiovisual saliency map models. In the simplest model, the location of auditory source is modeled as a Gaussian and use different methods of combining the auditory and visual information. We then provide experimental results with applications of simple audio-visual integration models for cognitive scene analysis. We validate the simple audio-visual saliency models with a hardware convolutional network architecture and real data recorded from moving audio-visual objects. The latter system was developed under Torch language by extending the attention.lua (code) and attention.ui (GUI) files that implement Culurciello´s visual attention model.
Keywords :
audio-visual systems; graphical user interfaces; telecommunication computing; Culurciello visual attention model; Gaussian model; Torch language; attention.lua file; attention.ui file; audiovisual integration model; audiovisual saliency map model; auditory information; auditory source location; cognitive scene analysis; hardware convolutional network architecture; moving audiovisual objects; visual information; Graphical user interfaces; Hardware; Robustness; Speech; Speech recognition; Visualization; Welding;
Conference_Titel :
Information Sciences and Systems (CISS), 2013 47th Annual Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
978-1-4673-5237-6
Electronic_ISBN :
978-1-4673-5238-3
DOI :
10.1109/CISS.2013.6552285