DocumentCode
2035776
Title
A dynamic approach for handling data skew problems in parallel hash join computation
Author
Xiaofang Zhou ; Orlowska, M.E.
Author_Institution
Dept. of Comput. Sci., Queensland Univ., St. Lucia, Qld., Australia
Volume
1
fYear
1993
fDate
19-21 Oct. 1993
Firstpage
133
Abstract
Data skew can result in load imbalance in parallel hash join. We introduce a new term, solid data skew, and classify data skew into plane and solid categories in this paper. The existing algorithms consider plane data skew only thus not adequate to approach solid data skew problem. A dynamic approach for handling solid data skew problem is proposed. It assigns join subtasks to processors during runtime using a method based on the estimation of execution time required for the computation.<>
Keywords
distributed databases; file organisation; merging; data skew; data skew problems; database applications; database operations; distributed database systems; execution time; load imbalance; parallel database; parallel hash join computation; parallel join algorithm; Algorithm design and analysis; Computer science; Concurrent computing; Database systems; Distributed databases; Load management; Parallel processing; Relational databases; Runtime; Solids;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON '93. Proceedings. Computer, Communication, Control and Power Engineering.1993 IEEE Region 10 Conference on
Conference_Location
Beijing, China
Print_ISBN
0-7803-1233-3
Type
conf
DOI
10.1109/TENCON.1993.319946
Filename
319946
Link To Document