Multi-label Cross-Modal Retrieval

Author

Viresh Ranjan;Nikhil Rasiwasia;C. V. Jawahar

fYear

2015

Firstpage

4094

Lastpage

4102

Abstract

In this work, we address the problem of cross-modal retrieval in presence of multi-label annotations. In particular, we introduce multi-label Canonical Correlation Analysis (ml-CCA), an extension of CCA, for learning shared subspaces taking into account high level semantic information in the form of multi-label annotations. Unlike CCA, ml-CCA does not rely on explicit pairing between modalities, instead it uses the multi-label information to establish correspondences. This results in a discriminative subspace which is better suited for cross-modal retrieval tasks. We also present Fast ml-CCA, a computationally efficient version of ml-CCA, which is able to handle large scale datasets. We show the efficacy of our approach by conducting extensive cross-modal retrieval experiments on three standard benchmark datasets. The results show that the proposed approach achieves state of the art retrieval performance on the three datasets.

Keywords

"Correlation","Semantics","Multimedia communication","Benchmark testing","Computer vision","Standards","Portable computers"

Publisher

ieee

Conference_Titel

Computer Vision (ICCV), 2015 IEEE International Conference on

Electronic_ISBN

2380-7504

Type

conf

DOI

10.1109/ICCV.2015.466

Filename

7410823

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3748896