• DocumentCode
    81984
  • Title

    Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation

  • Author

    Luming Zhang ; Yue Gao ; Zimmermann, Raphael ; Qi Tian ; Xuelong Li

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
  • Volume
    23
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    1419
  • Lastpage
    1429
  • Abstract
    Photo aesthetic quality evaluation is a fundamental yet under addressed task in computer vision and image processing fields. Conventional approaches are frustrated by the following two drawbacks. First, both the local and global spatial arrangements of image regions play an important role in photo aesthetics. However, existing rules, e.g., visual balance, heuristically define which spatial distribution among the salient regions of a photo is aesthetically pleasing. Second, it is difficult to adjust visual cues from multiple channels automatically in photo aesthetics assessment. To solve these problems, we propose a new photo aesthetics evaluation framework, focusing on learning the image descriptors that characterize local and global structural aesthetics from multiple visual channels. In particular, to describe the spatial structure of the image local regions, we construct graphlets small-sized connected graphs by connecting spatially adjacent atomic regions. Since spatially adjacent graphlets distribute closely in their feature space, we project them onto a manifold and subsequently propose an embedding algorithm. The embedding algorithm encodes the photo global spatial layout into graphlets. Simultaneously, the importance of graphlets from multiple visual channels are dynamically adjusted. Finally, these post-embedding graphlets are integrated for photo aesthetics evaluation using a probabilistic model. Experimental results show that: 1) the visualized graphlets explicitly capture the aesthetically arranged atomic regions; 2) the proposed approach generalizes and improves four prominent aesthetic rules; and 3) our approach significantly outperforms state-of-the-art algorithms in photo aesthetics prediction.
  • Keywords
    computer vision; graph theory; image coding; image fusion; probability; adjacent atomic regions; computer vision; embedding algorithm; global spatial arrangement; global structural aesthetics; graphlets small-sized connected graphs; image descriptors; image processing fields; local spatial arrangement; local structural aesthetics; multichannel global structural cue fusion; multichannel local cue fusion; photo aesthetic quality evaluation; photo aesthetics assessment; photo aesthetics evaluation framework; photo aesthetics prediction; photo global spatial layout encoding; post-embedding graphlets; probabilistic model; spatial structure; spatially adjacent graphlets; visual channels; visual cues; visualized graphlets; Graphical models; Image color analysis; Layout; Probabilistic logic; Training; Vectors; Visualization; Multi-channel; aesthetic evaluation; probabilistic model; structural cues;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2014.2303650
  • Filename
    6728663