DocumentCode :
1498693
Title :
Learning-Based Prediction of Visual Attention for Video Signals
Author :
Lee, Wen-Fu ; Huang, Tai-Hsiang ; Yeh, Su-Ling ; Chen, Homer H.
Author_Institution :
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
20
Issue :
11
fYear :
2011
Firstpage :
3028
Lastpage :
3038
Abstract :
Visual attention, which is an important characteristic of human visual system, is a useful clue for image processing and compression applications in the real world. This paper proposes a computational scheme that adopts both low-level and high-level features to predict visual attention from video signal by machine learning. The adoption of low-level features (color, orientation, and motion) is based on the study of visual cells, and the adoption of the human face as a high-level feature is based on the study of media communications. We show that such a scheme is more robust than those using purely single low- or high-level features. Unlike conventional techniques, our scheme is able to learn the relationship between features and visual attention to avoid perceptual mismatch between the estimated salience and the actual human fixation. We also show that selecting the representative training samples according to the fixation distribution improves the efficacy of regressive training. Experimental results are shown to demonstrate the advantages of the proposed scheme.
Keywords :
data compression; feature extraction; image representation; learning (artificial intelligence); video coding; actual human fixation distribution; high-level feature; human face adoption; human visual system; image compression; image processing; learning-based prediction; low-level feature; machine learning; media communication; regressive training; representative training sample; video signal; visual attention; visual cell; Cameras; Feature extraction; Humans; Image color analysis; Pixel; Training; Visualization; Eye tracking experiment; fixation distribution; human visual system; regression; saliency map; visual attention; Adult; Algorithms; Artificial Intelligence; Female; Humans; Male; Pattern Recognition, Visual; Photic Stimulation; Video Recording; Visual Fields;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2011.2144610
Filename :
5752852
Link To Document :
بازگشت