DocumentCode :
1659360
Title :
3D motion in visual saliency modeling
Author :
Pengfei Wan ; Yunlong Feng ; Cheung, Gene ; Bajic, Ivan V. ; Au, Oscar C. ; Yusheng Ji
Author_Institution :
Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fYear :
2013
Firstpage :
1831
Lastpage :
1835
Abstract :
Visual saliency is a probabilistic estimate of how likely a given spatial area in an image or video is to attract human visual attention relative to other areas. Bottom-up saliency models aggregate low-level image features like luminance and color contrast, flicker, 2D motion, etc. to construct a plausible saliency map. In this paper, we introduce 3D motion (object movements towards or away from the observer) into bottom-up video saliency modeling. Given availability of per-pixel depth maps, we first propose a novel algorithm to estimate 3D motion vectors (3DMVs) for arbitrarily shaped sub-blocks in texture-plus-depth videos. We then derive two feature channels from 3DMVs to be incorporated into a widely accepted bottom-up saliency model. Experiments on subjective quality of Region-of-Interest (ROI) based video coding show that our enriched saliency model with 3DMV channels is more accurate in estimating human visual attention.
Keywords :
motion estimation; video signal processing; 3D motion vector estimation; bottom up video saliency modeling; human visual attention; low level image feature; object movement; per pixel depth map; spatial area; texture plus depth video; visual saliency modeling; Computational modeling; Motion estimation; Solid modeling; Streaming media; Three-dimensional displays; Video coding; Visualization; 3D motion estimation; ROI-based video coding; visual saliency computation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6637969
Filename :
6637969
Link To Document :
بازگشت