DocumentCode :
2718298
Title :
Leveraging category-level labels for instance-level image retrieval
Author :
Gordoa, Albert ; Rodriguez-Serrano, Jose A ; Perronnin, Florent ; Valveny, Ernest
Author_Institution :
Textual & Visual Pattern Anal., Xerox Res. Center Eur., Meylan, France
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
3045
Lastpage :
3052
Abstract :
In this article, we focus on the problem of large-scale instance-level image retrieval. For efficiency reasons, it is common to represent an image by a fixed-length descriptor which is subsequently encoded into a small number of bits. We note that most encoding techniques include an unsupervised dimensionality reduction step. Our goal in this work is to learn a better subspace in a supervised manner. We especially raise the following question: "can category-level labels be used to learn such a subspace?" To answer this question, we experiment with four learning techniques: the first one is based on a metric learning framework, the second one on attribute representations, the third one on Canonical Correlation Analysis (CCA) and the fourth one on Joint Subspace and Classifier Learning (JSCL). While the first three approaches have been applied in the past to the image retrieval problem, we believe we are the first to show the usefulness of JSCL in this context. In our experiments, we use ImageNet as a source of category-level labels and report retrieval results on two standard dataseis: INRIA Holidays and the University of Kentucky benchmark. Our experimental study shows that metric learning and attributes do not lead to any significant improvement in retrieval accuracy, as opposed to CCA and JSCL. As an example, we report on Holidays an increase in accuracy from 39.3% to 48.6% with 32-dimensional representations. Overall JSCL is shown to yield the best results.
Keywords :
correlation methods; encoding; image classification; image representation; image retrieval; learning (artificial intelligence); CCA; INRIA Holidays; ImageNet; JSCL; University of Kentucky benchmark; attribute representations; canonical correlation analysis; category-level labels; encoding techniques; fixed-length descriptor; image representation; joint subspace and classifier learning; large-scale instance-level image retrieval; metric learning framework; subspace learning; unsupervised dimensionality reduction step; Accuracy; Correlation; Encoding; Image retrieval; Measurement; Principal component analysis; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6248035
Filename :
6248035
Link To Document :
بازگشت