DocumentCode
2718298
Title
Leveraging category-level labels for instance-level image retrieval
Author
Gordoa, Albert ; Rodriguez-Serrano, Jose A ; Perronnin, Florent ; Valveny, Ernest
Author_Institution
Textual & Visual Pattern Anal., Xerox Res. Center Eur., Meylan, France
fYear
2012
fDate
16-21 June 2012
Firstpage
3045
Lastpage
3052
Abstract
In this article, we focus on the problem of large-scale instance-level image retrieval. For efficiency reasons, it is common to represent an image by a fixed-length descriptor which is subsequently encoded into a small number of bits. We note that most encoding techniques include an unsupervised dimensionality reduction step. Our goal in this work is to learn a better subspace in a supervised manner. We especially raise the following question: "can category-level labels be used to learn such a subspace?" To answer this question, we experiment with four learning techniques: the first one is based on a metric learning framework, the second one on attribute representations, the third one on Canonical Correlation Analysis (CCA) and the fourth one on Joint Subspace and Classifier Learning (JSCL). While the first three approaches have been applied in the past to the image retrieval problem, we believe we are the first to show the usefulness of JSCL in this context. In our experiments, we use ImageNet as a source of category-level labels and report retrieval results on two standard dataseis: INRIA Holidays and the University of Kentucky benchmark. Our experimental study shows that metric learning and attributes do not lead to any significant improvement in retrieval accuracy, as opposed to CCA and JSCL. As an example, we report on Holidays an increase in accuracy from 39.3% to 48.6% with 32-dimensional representations. Overall JSCL is shown to yield the best results.
Keywords
correlation methods; encoding; image classification; image representation; image retrieval; learning (artificial intelligence); CCA; INRIA Holidays; ImageNet; JSCL; University of Kentucky benchmark; attribute representations; canonical correlation analysis; category-level labels; encoding techniques; fixed-length descriptor; image representation; joint subspace and classifier learning; large-scale instance-level image retrieval; metric learning framework; subspace learning; unsupervised dimensionality reduction step; Accuracy; Correlation; Encoding; Image retrieval; Measurement; Principal component analysis; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location
Providence, RI
ISSN
1063-6919
Print_ISBN
978-1-4673-1226-4
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2012.6248035
Filename
6248035
Link To Document