DocumentCode :
1398576
Title :
Removing Label Ambiguity in Learning-Based Visual Saliency Estimation
Author :
Li, Jia ; Xu, Dong ; Gao, Wen
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume :
21
Issue :
4
fYear :
2012
fDate :
4/1/2012 12:00:00 AM
Firstpage :
1513
Lastpage :
1525
Abstract :
Visual saliency is a useful clue to depict visually important image/video contents in many multimedia applications. In visual saliency estimation, a feasible solution is to learn a “feature-saliency” mapping model from the user data obtained by manually labeling activities or eye-tracking devices. However, label ambiguities may also arise due to the inaccurate and inadequate user data. To process the noisy training data, we propose a multi-instance learning to rank approach for visual saliency estimation. In our approach, the correlations between various image patches are incorporated into an ordinal regression framework. By iteratively refining a ranking model and relabeling the image patches with respect to their mutual correlations, the label ambiguities can be effectively removed from the training data. Consequently, visual saliency can be effectively estimated by the ranking model, which can pop out real targets and suppress real distractors. Extensive experiments on two public image data sets show that our approach outperforms 11 state-of-the-art methods remarkably in visual saliency estimation.
Keywords :
content-based retrieval; estimation theory; image processing; feature saliency mapping; image/video contents; label ambiguity removal; learning based visual saliency estimation; multimedia applications; user data; Correlation; Data models; Estimation; Feature extraction; Training; Training data; Visualization; Label ambiguity; learning to rank; multi-instance learning (MIL); visual saliency; Algorithms; Artifacts; Artificial Intelligence; Documentation; Humans; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Pattern Recognition, Automated; Visual Perception;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2011.2179665
Filename :
6104149
Link To Document :
بازگشت