DocumentCode :
188955
Title :
Combining Retrieval Results for Balanced Effectiveness and Efficiency in the Big Data Search Environment
Author :
Shengli Wu ; Chunlan Huang ; Jieyu Li
Author_Institution :
Sch. of Comput. Sci. & Telecomm Eng., Jiangsu Univ., Zhenjiang, China
fYear :
2014
fDate :
11-13 Sept. 2014
Firstpage :
555
Lastpage :
560
Abstract :
In the big data age, we have to deal with tremendous amount of information, which is collected from various types of sources. For information retrieval systems, the collection of documents becomes larger and larger. For some query, an information retrieval system needs to retrieve a large number of documents as the result to the query. In reality, very often people mainly care about some top-ranked documents rather than the complete long list of documents. In such a situation, how to develop a retrieval system with desirable efficiency and effectiveness is a research problem. In this paper, we focus on the data fusion approach to information retrieval, in which each component retrieval system contributes a result and all the results are combined by a combination method. The goal of this research is to find a feasible combination method that is able to balance effectiveness and efficiency. Using 3 groups of historical runs from TREC for the experiment, we find that with the weights trained by weighted linear regression, the linear combination method can achieve good results in effectiveness and efficiency.
Keywords :
Big Data; information retrieval systems; query processing; regression analysis; sensor fusion; Big Data search environment; TREC; component retrieval system; data fusion approach; effectiveness balancing; efficiency balancing; information retrieval systems; linear combination method; query; top-ranked documents; weighted linear regression; Big data; Data integration; Linear regression; Measurement; Training; Web search; data fusion; information retrieval; linear combination; results combination; web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (CIT), 2014 IEEE International Conference on
Conference_Location :
Xi´an
Type :
conf
DOI :
10.1109/CIT.2014.137
Filename :
6984710
Link To Document :
بازگشت