• DocumentCode
    3294663
  • Title

    Spatiotemporal Saliency Detection via Sparse Representation

  • Author

    Ren, Zhixiang ; Gao, Shenghua ; Rajan, Deepu ; Chia, Liang-Tien ; Huang, Yun

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2012
  • fDate
    9-13 July 2012
  • Firstpage
    158
  • Lastpage
    163
  • Abstract
    Multimedia applications like retrieval, copy detection etc. can gain from saliency detection, which is essentially a method to identify areas in images and videos that capture the attention of the human visual system. In this paper, we propose a new spatiotemporal saliency framework for videos based on sparse representation. For temporal saliency, we model the movement of the target patch as a reconstruction process, and the overlapping patches in neighboring frames are used to reconstruct the target patch. The learned coefficients encode the positions of the matched patches, which are able to represent the motion trajectory of the target patch. We also introduce a smoothing term into our sparse coding framework to learn coherent motion trajectories. Based on the psychological findings that abrupt stimulus could cause a rapid and involuntary deployment of attention, our temporal model combines the reconstruction error, sparsity regularizer, and local trajectory contrast to measure the motion saliency. For spatial saliency, a similar sparse reconstruction process is adopted to capture the regions with high center-surround contrast. Finally, the temporal saliency and spatial saliency are combined by agreement to favor the salient regions with high confidence. Experimental results on a human fixation video dataset show our method achieved the best performance over five state-of-the-art approaches.
  • Keywords
    computer vision; error analysis; image coding; image matching; image motion analysis; image reconstruction; image representation; multimedia computing; object detection; smoothing methods; spatiotemporal phenomena; video signal processing; coherent motion trajectory; human visual system; image reconstruction; image smoothing; motion saliency measurement; motion trajectory representation; multimedia computing; overlapped patch; reconstruction error; sparse coding; sparse reconstruction process; sparse representation; sparsity regularizer; spatiotemporal saliency detection; target patch matching; target patch movement model; video saliency detection; Computational modeling; Dictionaries; Encoding; Humans; Spatiotemporal phenomena; Trajectory; Videos; Motion trajectory; Sparse coding; Spatiotemporal saliency detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2012 IEEE International Conference on
  • Conference_Location
    Melbourne, VIC
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4673-1659-0
  • Type

    conf

  • DOI
    10.1109/ICME.2012.173
  • Filename
    6298391