DocumentCode :
113166
Title :
Retargeting Semantically-Rich Photos
Author :
Luming Zhang ; Meng Wang ; Liqiang Nie ; Liang Hong ; Yong Rui ; Qi Tian
Author_Institution :
Dept. of Electr. Eng. & Inf. Syst., Hefei Univ. of Technol., Hefei, China
Volume :
17
Issue :
9
fYear :
2015
fDate :
Sept. 2015
Firstpage :
1538
Lastpage :
1549
Abstract :
Semantically-rich photos contain a rich variety of semantic objects (e.g., pedestrians and bicycles). Retargeting these photos is a challenging task since each semantic object has fixed geometric characteristics. Shrinking these objects simultaneously during retargeting is prone to distortion. In this paper, we propose to retarget semantically-rich photos by detecting photo semantics from image tags, which are predicted by a multi-label SVM. The key technique is a generative model termed latent stability discovery (LSD). It can robustly localize various semantic objects in a photo by making use of the predicted noisy image tags. Based on LSD, a feature fusion algorithm is proposed to detect salient regions at both the low-level and high-level. These salient regions are linked into a path sequentially to simulate human visual perception . Finally, we learn the prior distribution of such paths from aesthetically pleasing training photos. The prior enforces the path of a retargeted photo to be maximally similar to those from the training photos. In the experiment, we collect 217 1600 ×1200 photos, each containing over seven salient objects. Comprehensive user studies demonstrate the competitiveness of our method.
Keywords :
object detection; support vector machines; LSD model; human visual perception; latent stability discovery; multilabel SVM; object shrinking; photo semantics detection; salient region detection; semantic object; semantically-rich photos; support vector machines; Adaptation models; Computational modeling; Distortion; Feature extraction; Noise measurement; Semantics; Visualization; Human perception; image tags; noisy; retargeting; semantically-rich; shrink;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2015.2451954
Filename :
7145436
Link To Document :
بازگشت