DocumentCode
2988225
Title
An empirical study of fusion operators for multimodal image retrieval
Author
Csurka, Gabriela ; Clinchant, Stéphane
Author_Institution
Xerox Res. Center Eur., Meylan, France
fYear
2012
fDate
27-29 June 2012
Firstpage
1
Lastpage
6
Abstract
In this paper we propose an empirical study of late fusion operators for multimodal image retrieval. Therefore, we consider two experts, one based on textual and one on visual similarities between documents and study the possibilities to go beyond simple score averaging. The main idea is to exploit the correlation between the two experts by encoding explicitly or implicitly an "and" and an "or" operator in an efficient way. We show through several experiments that the operators that combine both of these two aspects generally outperform the ones that look only to one of them. Based on this observation we propose several generalized version of most classical fusion operators and compare them using ImageClef benchmark datasets both in an unsupervised and in a supervised framework.
Keywords
document handling; image fusion; image retrieval; ImageClef benchmark datasets; and operator; classical fusion operators; documents; multimodal image retrieval; or operator; score averaging; textual similarity; visual similarity; Electronic publishing; Encyclopedias; Harmonic analysis; Internet; Semantics; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
Conference_Location
Annecy
ISSN
1949-3983
Print_ISBN
978-1-4673-2368-0
Electronic_ISBN
1949-3983
Type
conf
DOI
10.1109/CBMI.2012.6269843
Filename
6269843
Link To Document