DocumentCode :
2847094
Title :
Progressive distributed top-k retrieval in peer-to-peer networks
Author :
Balke, Wolf-Tilo ; Nejdl, Wolfgang ; Siberski, Wolf ; Thaden, Uwe
Author_Institution :
California Univ., Berkeley, CA, USA
fYear :
2005
fDate :
5-8 April 2005
Firstpage :
174
Lastpage :
185
Abstract :
Query processing in traditional information management systems has moved from an exact match model to more flexible paradigms allowing cooperative retrieval by aggregating the database objects´ degree of match for each different query predicate and returning the best matching objects only. In peer-to-peer systems such strategies are even more important, given the potentially large number of peers, which may contribute to the results. Yet current peer-to-peer research has barely started to investigate such approaches. In this paper we discuss the benefits of best match/top-k queries in the context of distributed peer-to-peer information infrastructures and show how to extend the limited query processing in current peer-to-peer networks by allowing the distributed processing of top-k queries, while maintaining a minimum of data traffic. Relying on a super-peer backbone organized in the HyperCuP topology we show how to use local indexes for optimizing the necessary query routing and how to process intermediate results in inner network nodes at the earliest possible point in time cutting down the necessary data traffic within the network. Our algorithm is based on dynamically collected query statistics only, no continuous index update processes are necessary, allowing it to scale easily to large numbers of peers, as well as dynamic additions/deletions of peers. We show our approach to always deliver correct result sets and to be optimal in terms of necessary object accesses and data traffic. Finally, we present simulation results for both static and dynamic network environments.
Keywords :
database indexing; peer-to-peer computing; query processing; HyperCuP topology; data traffic; database object; distributed peer-to-peer information infrastructure; index update; information management system; peer-to-peer network; progressive distributed top-k retrieval; query predicate; query processing; query routing; query statistics; Databases; Distributed processing; Information management; Information retrieval; Network topology; Peer to peer computing; Query processing; Spine; Statistics; Telecommunication traffic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
ISSN :
1084-4627
Print_ISBN :
0-7695-2285-8
Type :
conf
DOI :
10.1109/ICDE.2005.115
Filename :
1410118
Link To Document :
بازگشت