DocumentCode :
2786883
Title :
Optimized Inverted List Assignment in Distributed Search Engine Architectures
Author :
Zhang, Jiangong ; Suel, Torsten
Author_Institution :
CIS Dept., Polytech. Univ., Brooklyn, NY
fYear :
2007
fDate :
26-30 March 2007
Firstpage :
1
Lastpage :
10
Abstract :
We study efficient query processing in distributed Web search engines with global index organization. The main performance bottleneck in this case is due to the large amount of index data that is exchanged between nodes during the processing of a query, and previous work has proposed several techniques for significantly reducing this cost. We describe an approach that provides substantial additional improvement over previous techniques. In particular, we analyze search engine query traces in order to optimize the assignment of index data to the nodes in the system, such that terms frequently occurring together in queries are also often collocated on the same node. Our experiments show that in return for a modest factor increase in storage space, we can achieve a reduction in communication cost of an order of magnitude over the previous best techniques.
Keywords :
data structures; peer-to-peer computing; query processing; search engines; distributed Web search engine architecture; index organization; inverted list assignment optimization; peer-to-peer architecture; query processing; Bandwidth; Computational Intelligence Society; Costs; Information retrieval; Large-scale systems; Peer to peer computing; Query processing; Search engines; Service oriented architecture; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location :
Long Beach, CA
Print_ISBN :
1-4244-0910-1
Electronic_ISBN :
1-4244-0910-1
Type :
conf
DOI :
10.1109/IPDPS.2007.370231
Filename :
4227959
Link To Document :
بازگشت