• DocumentCode
    2988225
  • Title

    An empirical study of fusion operators for multimodal image retrieval

  • Author

    Csurka, Gabriela ; Clinchant, Stéphane

  • Author_Institution
    Xerox Res. Center Eur., Meylan, France
  • fYear
    2012
  • fDate
    27-29 June 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper we propose an empirical study of late fusion operators for multimodal image retrieval. Therefore, we consider two experts, one based on textual and one on visual similarities between documents and study the possibilities to go beyond simple score averaging. The main idea is to exploit the correlation between the two experts by encoding explicitly or implicitly an "and" and an "or" operator in an efficient way. We show through several experiments that the operators that combine both of these two aspects generally outperform the ones that look only to one of them. Based on this observation we propose several generalized version of most classical fusion operators and compare them using ImageClef benchmark datasets both in an unsupervised and in a supervised framework.
  • Keywords
    document handling; image fusion; image retrieval; ImageClef benchmark datasets; and operator; classical fusion operators; documents; multimodal image retrieval; or operator; score averaging; textual similarity; visual similarity; Electronic publishing; Encyclopedias; Harmonic analysis; Internet; Semantics; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
  • Conference_Location
    Annecy
  • ISSN
    1949-3983
  • Print_ISBN
    978-1-4673-2368-0
  • Electronic_ISBN
    1949-3983
  • Type

    conf

  • DOI
    10.1109/CBMI.2012.6269843
  • Filename
    6269843