• DocumentCode
    2292829
  • Title

    Landmark classification in large-scale image collections

  • Author

    Li, Yunpeng ; Crandall, David J. ; Huttenlocher, Daniel P.

  • Author_Institution
    Dept. of Comput. Sci., Cornell Univ., Ithaca, NY, USA
  • fYear
    2009
  • fDate
    Sept. 29 2009-Oct. 2 2009
  • Firstpage
    1957
  • Lastpage
    1964
  • Abstract
    With the rise of photo-sharing websites such as Facebook and Flickr has come dramatic growth in the number of photographs online. Recent research in object recognition has used such sites as a source of image data, but the test images have been selected and labeled by hand, yielding relatively small validation sets. In this paper we study image classification on a much larger dataset of 30 million images, including nearly 2 million of which have been labeled into one of 500 categories. The dataset and categories are formed automatically from geotagged photos from Flickr, by looking for peaks in the spatial geotag distribution corresponding to frequently-photographed landmarks. We learn models for these landmarks with a multiclass support vector machine, using vector-quantized interest point descriptors as features. We also explore the non-visual information available on modern photo-sharing sites, showing that using textual tags and temporal constraints leads to significant improvements in classification rate. We find that in some cases image features alone yield comparable classification accuracy to using text tags as well as to the performance of human observers.
  • Keywords
    Web sites; image classification; object recognition; support vector machines; image classification; landmark classification; large-scale image collections; multiclass support vector machine; object recognition; photo-sharing Web sites; vector-quantized interest point descriptors; Computer science; Computer vision; Facebook; Image classification; Internet; Large-scale systems; Object recognition; Support vector machine classification; Support vector machines; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2009 IEEE 12th International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1550-5499
  • Print_ISBN
    978-1-4244-4420-5
  • Electronic_ISBN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2009.5459432
  • Filename
    5459432