• DocumentCode
    113166
  • Title

    Retargeting Semantically-Rich Photos

  • Author

    Luming Zhang ; Meng Wang ; Liqiang Nie ; Liang Hong ; Yong Rui ; Qi Tian

  • Author_Institution
    Dept. of Electr. Eng. & Inf. Syst., Hefei Univ. of Technol., Hefei, China
  • Volume
    17
  • Issue
    9
  • fYear
    2015
  • fDate
    Sept. 2015
  • Firstpage
    1538
  • Lastpage
    1549
  • Abstract
    Semantically-rich photos contain a rich variety of semantic objects (e.g., pedestrians and bicycles). Retargeting these photos is a challenging task since each semantic object has fixed geometric characteristics. Shrinking these objects simultaneously during retargeting is prone to distortion. In this paper, we propose to retarget semantically-rich photos by detecting photo semantics from image tags, which are predicted by a multi-label SVM. The key technique is a generative model termed latent stability discovery (LSD). It can robustly localize various semantic objects in a photo by making use of the predicted noisy image tags. Based on LSD, a feature fusion algorithm is proposed to detect salient regions at both the low-level and high-level. These salient regions are linked into a path sequentially to simulate human visual perception . Finally, we learn the prior distribution of such paths from aesthetically pleasing training photos. The prior enforces the path of a retargeted photo to be maximally similar to those from the training photos. In the experiment, we collect 217 1600 ×1200 photos, each containing over seven salient objects. Comprehensive user studies demonstrate the competitiveness of our method.
  • Keywords
    object detection; support vector machines; LSD model; human visual perception; latent stability discovery; multilabel SVM; object shrinking; photo semantics detection; salient region detection; semantic object; semantically-rich photos; support vector machines; Adaptation models; Computational modeling; Distortion; Feature extraction; Noise measurement; Semantics; Visualization; Human perception; image tags; noisy; retargeting; semantically-rich; shrink;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2015.2451954
  • Filename
    7145436