• DocumentCode
    2539026
  • Title

    Applying statistical principles to data fusion in information retrieval

  • Author

    Wu, Shengli ; Bi, Yaxin ; Mcclean, Sally

  • Author_Institution
    Univ. of Ulster, Coleraine
  • fYear
    2007
  • fDate
    7-10 Oct. 2007
  • Firstpage
    313
  • Lastpage
    319
  • Abstract
    Data fusion in information retrieval has been investigated by many researchers and quite a few data fusion methods have been proposed. However, their impact on effectiveness has not been well understood. In this paper, we apply statistical principles to data fusion and present a statistical data fusion model, which specifies the algorithm for fusion and conditions to be satisfied. The statistical model can be used as a guideline for data fusion methods. Based on this analysis, we compare CombSum and CombMNZ, which are the two best-known data fusion methods. We explain why sometimes CombMNZ does outperform Comb- Sum and what can be done to make CombSum more effective. Experimental results with TREC data are reported to support the conclusion that our enhancements to the algorithm improve effectiveness.
  • Keywords
    information retrieval; sensor fusion; statistical analysis; data fusion; information retrieval; statistical principle; Bayesian methods; Bismuth; Correlation; Euclidean distance; Guidelines; Information retrieval; Mathematics; Merging; Metasearch; Sampling methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    978-1-4244-0990-7
  • Electronic_ISBN
    978-1-4244-0991-4
  • Type

    conf

  • DOI
    10.1109/ICSMC.2007.4413590
  • Filename
    4413590