• DocumentCode
    258676
  • Title

    Visual attention modeling for 3D video using neural networks

  • Author

    Iatsun, Iana ; Larabi, Mohamed-Chaker ; Fernandez-Maloigne, Christine

  • Author_Institution
    XLIM-SIC Lab., Univ. of Poitiers, Poitiers, France
  • fYear
    2014
  • fDate
    9-10 Dec. 2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Visual attention is one of the most important mechanisms in the human visual perception. Recently, its modeling becomes a principal requirement for the optimization of the image processing systems. Numerous algorithms have already been designed for 2D saliency prediction. However, only few works can be found for 3D content. In this study, we propose a saliency model for stereoscopic 3D video. This algorithm extracts information from three dimensions of content, i.e. spatial, temporal and depth. This model benefits from the properties of interest points to be close to human fixations in order to build spatial salient features. Besides, as the perception of depth relies strongly on monocular cues, our model extracts the depth salient features using the pictorial depth sources. Since weights for fusion strategy are often selected in ad-hoc manner, in this work, we suggest to use a machine learning approach. The used artificial Neural Network allows to define adaptive weights based on the eye-tracking data. The results of the proposed algorithm are tested versus ground-truth information using the state-of-the-art techniques.
  • Keywords
    feature extraction; image fusion; learning (artificial intelligence); neural nets; spatiotemporal phenomena; stereo image processing; video signal processing; visual perception; 2D saliency prediction; ad-hoc method; adaptive weight selection; artificial neural network; depth dimension; depth perception; depth salient features; eye-tracking data; fusion strategy; human fixations; human visual perception; image processing system optimization; information extraction; interest point properties; machine learning approach; monocular cues; pictorial depth sources; saliency model; spatial dimension; spatial salient features; stereoscopic 3D video; temporal dimension; visual attention modeling; Computational modeling; Equations; Feature extraction; Mathematical model; Solid modeling; Three-dimensional displays; Visualization; Saliency; interest points; monocular cues; neural networks; stereoscopic content; visual attention;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    3D Imaging (IC3D), 2014 International Conference on
  • Conference_Location
    Liege
  • Type

    conf

  • DOI
    10.1109/IC3D.2014.7032602
  • Filename
    7032602