DocumentCode :
1813284
Title :
Load balancing in pipelined processing of multi-join queries
Author :
Lu, Hongjun ; Kian-Lee Tan ; Lee, Chiang
Author_Institution :
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
fYear :
1994
fDate :
19-22 Dec 1994
Firstpage :
670
Lastpage :
675
Abstract :
Looks at how to effectively exploit pipelining for multi-join queries in shared-nothing systems. A multi-join query can be processed using an iterative approach. In each iteration, several relations are selected and are joined in a pipelined fashion. However, algorithms that are based on this approach have traditionally assumed that the relations are uniformly distributed or only slightly skewed. When this assumption is relaxed, i.e. when the data is skewed, some nodes may be assigned a larger amount of data than can fit into their memories. As such, pipelining cannot be effectively exploited, and performance may degenerate drastically. We propose four skew handling techniques to deal with data skew for multi-join queries. The results of a performance study show that a hybrid technique is superior in most cases
Keywords :
distributed databases; pipeline processing; query processing; relational databases; resource allocation; software performance evaluation; data skew; hybrid technique; iterative approach; load balancing; multi-join queries; performance degeneration; pipelined processing; relation joining; shared-nothing systems; Communication networks; Computer science; Database systems; Information systems; Iterative algorithms; Iterative methods; Load management; Pipeline processing; System performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems, 1994. International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
0-8186-6555-6
Type :
conf
DOI :
10.1109/ICPADS.1994.590427
Filename :
590427
Link To Document :
بازگشت