DocumentCode
2539026
Title
Applying statistical principles to data fusion in information retrieval
Author
Wu, Shengli ; Bi, Yaxin ; Mcclean, Sally
Author_Institution
Univ. of Ulster, Coleraine
fYear
2007
fDate
7-10 Oct. 2007
Firstpage
313
Lastpage
319
Abstract
Data fusion in information retrieval has been investigated by many researchers and quite a few data fusion methods have been proposed. However, their impact on effectiveness has not been well understood. In this paper, we apply statistical principles to data fusion and present a statistical data fusion model, which specifies the algorithm for fusion and conditions to be satisfied. The statistical model can be used as a guideline for data fusion methods. Based on this analysis, we compare CombSum and CombMNZ, which are the two best-known data fusion methods. We explain why sometimes CombMNZ does outperform Comb- Sum and what can be done to make CombSum more effective. Experimental results with TREC data are reported to support the conclusion that our enhancements to the algorithm improve effectiveness.
Keywords
information retrieval; sensor fusion; statistical analysis; data fusion; information retrieval; statistical principle; Bayesian methods; Bismuth; Correlation; Euclidean distance; Guidelines; Information retrieval; Mathematics; Merging; Metasearch; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
Conference_Location
Montreal, Que.
Print_ISBN
978-1-4244-0990-7
Electronic_ISBN
978-1-4244-0991-4
Type
conf
DOI
10.1109/ICSMC.2007.4413590
Filename
4413590
Link To Document