Title :
A dynamic approach for handling data skew problems in parallel hash join computation
Author :
Xiaofang Zhou ; Orlowska, M.E.
Author_Institution :
Dept. of Comput. Sci., Queensland Univ., St. Lucia, Qld., Australia
Abstract :
Data skew can result in load imbalance in parallel hash join. We introduce a new term, solid data skew, and classify data skew into plane and solid categories in this paper. The existing algorithms consider plane data skew only thus not adequate to approach solid data skew problem. A dynamic approach for handling solid data skew problem is proposed. It assigns join subtasks to processors during runtime using a method based on the estimation of execution time required for the computation.<>
Keywords :
distributed databases; file organisation; merging; data skew; data skew problems; database applications; database operations; distributed database systems; execution time; load imbalance; parallel database; parallel hash join computation; parallel join algorithm; Algorithm design and analysis; Computer science; Concurrent computing; Database systems; Distributed databases; Load management; Parallel processing; Relational databases; Runtime; Solids;
Conference_Titel :
TENCON '93. Proceedings. Computer, Communication, Control and Power Engineering.1993 IEEE Region 10 Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-1233-3
DOI :
10.1109/TENCON.1993.319946