DocumentCode :
729707
Title :
Coupled dictionary learning and feature mapping for cross-modal retrieval
Author :
Xing Xu ; Shimada, Atsushi ; Taniguchi, Rin-ichiro ; Li He
Author_Institution :
Kyushu Univ., Fukuoka, Japan
fYear :
2015
fDate :
June 29 2015-July 3 2015
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, we investigate the problem of modeling images and associated text for cross-modal retrieval tasks such as text-to-image search and image-to-text search. To make the data from image and text modalities comparable, previous cross-modal retrieval methods directly learn two projection matrices to map the raw features of the two modalities into a common subspace, in which cross-modal data matching can be performed. However, the different feature representations and correlation structures of different modalities inhibit these methods from efficiently modeling the relationships across modalities through a common subspace. To handle the diversities of different modalities, we first leverage the coupled dictionary learning method to generate homogeneous sparse representations for different modalities by associating and jointly updating their dictionaries. We then use a coupled feature mapping scheme to project the derived sparse representations from different modalities into a common subspace in which cross-modal retrieval can be performed. Experiments on a variety of cross-modal retrieval tasks demonstrate that the proposed method outperforms the state-of-the-art approaches.
Keywords :
image matching; image representation; image retrieval; coupled dictionary learning; cross-modal data matching; cross-modal retrieval; feature mapping scheme; image-to-text search; sparse representation; text-to-image search; Dictionaries; Electronic publishing; Internet; Iterative methods; Semantics; Sparse matrices; Cross-modal retrieval; coupled dictionary learning; feature mapping; image annotation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
Type :
conf
DOI :
10.1109/ICME.2015.7177396
Filename :
7177396
Link To Document :
بازگشت