• DocumentCode
    3406540
  • Title

    Reading between the lines: Object localization using implicit cues from image tags

  • Author

    Hwang, Sung Ju ; Grauman, Kristen

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2010
  • fDate
    13-18 June 2010
  • Firstpage
    2971
  • Lastpage
    2978
  • Abstract
    Current uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image´s tags - the relative prominence of each object as signified by its order of mention, the scale constraints implied by unnamed objects, and the loose spatial links hinted by the proximity of names on the list. By learning a conditional density over the localization parameters (position and scale) given these cues, we show how to improve both accuracy and efficiency when detecting the tagged objects. We validate our approach with 25 object categories from the PASCAL VOC and LabelMe datasets, and demonstrate its effectiveness relative to both traditional sliding windows as well as a visual context baseline.
  • Keywords
    feature extraction; image processing; LabelMe dataset; conditional density; image tags; implicit cues; object localization; sliding window; spatial links; tagged image; visual context baseline; Computer science; Computer vision; Layout; Object detection; Object recognition; Organizing; Pixel; Social network services; System testing; Web services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-6984-0
  • Type

    conf

  • DOI
    10.1109/CVPR.2010.5540043
  • Filename
    5540043