DocumentCode :
3057690
Title :
Load Balancing Distributed Inverted Files: Query Ranking
Author :
Gomez-Pantoja, Carlos ; Marin, Mauricio
Author_Institution :
Univ. of Chile, Santiago
fYear :
2008
fDate :
13-15 Feb. 2008
Firstpage :
329
Lastpage :
333
Abstract :
Search engines use inverted files as index data structures to speed up the solution of user queries. The index is distributed on a set of processors forming a cluster of computers and queries are received by a broker machine and scheduled for solution in the cluster. The broker must use a scheduling algorithm to assign queries to processors since the computations associated with the ranking of documents that form part of the solutions to queries can take a significant fraction of the total running time. The cost of this task can be highly variable and depends on the particular user preferences for words when formulating queries in a given period of time. Thus the scheduling algorithm must be able to cope efficiently with a highly dynamic and very large amount of jobs being assigned in an on-line manner to the processors. In this paper we evaluate a number of scheduling algorithms proposed in the literature in the context of scheduling queries on a search engine.
Keywords :
data structures; distributed processing; query processing; scheduling; search engines; distributed inverted files; index data structures; load balancing; queries scheduling; query ranking; scheduling algorithm; search engines; Delay; Distributed computing; Load management; Parallel processing; Processor scheduling; Query processing; Scheduling algorithm; Search engines; Traffic control; Yarn; information retrieval; parallel computing; scheduling algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing, 2008. PDP 2008. 16th Euromicro Conference on
Conference_Location :
Toulouse
ISSN :
1066-6192
Print_ISBN :
978-0-7695-3089-5
Type :
conf
DOI :
10.1109/PDP.2008.93
Filename :
4457140
Link To Document :
بازگشت