DocumentCode
1813284
Title
Load balancing in pipelined processing of multi-join queries
Author
Lu, Hongjun ; Kian-Lee Tan ; Lee, Chiang
Author_Institution
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
fYear
1994
fDate
19-22 Dec 1994
Firstpage
670
Lastpage
675
Abstract
Looks at how to effectively exploit pipelining for multi-join queries in shared-nothing systems. A multi-join query can be processed using an iterative approach. In each iteration, several relations are selected and are joined in a pipelined fashion. However, algorithms that are based on this approach have traditionally assumed that the relations are uniformly distributed or only slightly skewed. When this assumption is relaxed, i.e. when the data is skewed, some nodes may be assigned a larger amount of data than can fit into their memories. As such, pipelining cannot be effectively exploited, and performance may degenerate drastically. We propose four skew handling techniques to deal with data skew for multi-join queries. The results of a performance study show that a hybrid technique is superior in most cases
Keywords
distributed databases; pipeline processing; query processing; relational databases; resource allocation; software performance evaluation; data skew; hybrid technique; iterative approach; load balancing; multi-join queries; performance degeneration; pipelined processing; relation joining; shared-nothing systems; Communication networks; Computer science; Database systems; Information systems; Iterative algorithms; Iterative methods; Load management; Pipeline processing; System performance;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems, 1994. International Conference on
Conference_Location
Hsinchu
Print_ISBN
0-8186-6555-6
Type
conf
DOI
10.1109/ICPADS.1994.590427
Filename
590427
Link To Document