Landmark classification in large-scale image collections

Author

Li, Yunpeng ; Crandall, David J. ; Huttenlocher, Daniel P.

Author_Institution

Dept. of Comput. Sci., Cornell Univ., Ithaca, NY, USA

fYear

2009

fDate

Sept. 29 2009-Oct. 2 2009

Firstpage

1957

Lastpage

1964

Abstract

With the rise of photo-sharing websites such as Facebook and Flickr has come dramatic growth in the number of photographs online. Recent research in object recognition has used such sites as a source of image data, but the test images have been selected and labeled by hand, yielding relatively small validation sets. In this paper we study image classification on a much larger dataset of 30 million images, including nearly 2 million of which have been labeled into one of 500 categories. The dataset and categories are formed automatically from geotagged photos from Flickr, by looking for peaks in the spatial geotag distribution corresponding to frequently-photographed landmarks. We learn models for these landmarks with a multiclass support vector machine, using vector-quantized interest point descriptors as features. We also explore the non-visual information available on modern photo-sharing sites, showing that using textual tags and temporal constraints leads to significant improvements in classification rate. We find that in some cases image features alone yield comparable classification accuracy to using text tags as well as to the performance of human observers.

Keywords

Web sites; image classification; object recognition; support vector machines; image classification; landmark classification; large-scale image collections; multiclass support vector machine; object recognition; photo-sharing Web sites; vector-quantized interest point descriptors; Computer science; Computer vision; Facebook; Image classification; Internet; Large-scale systems; Object recognition; Support vector machine classification; Support vector machines; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Vision, 2009 IEEE 12th International Conference on

Conference_Location

Kyoto

ISSN

1550-5499

Print_ISBN

978-1-4244-4420-5

Electronic_ISBN

1550-5499

Type

conf

DOI

10.1109/ICCV.2009.5459432

Filename

5459432