• DocumentCode
    2035776
  • Title

    A dynamic approach for handling data skew problems in parallel hash join computation

  • Author

    Xiaofang Zhou ; Orlowska, M.E.

  • Author_Institution
    Dept. of Comput. Sci., Queensland Univ., St. Lucia, Qld., Australia
  • Volume
    1
  • fYear
    1993
  • fDate
    19-21 Oct. 1993
  • Firstpage
    133
  • Abstract
    Data skew can result in load imbalance in parallel hash join. We introduce a new term, solid data skew, and classify data skew into plane and solid categories in this paper. The existing algorithms consider plane data skew only thus not adequate to approach solid data skew problem. A dynamic approach for handling solid data skew problem is proposed. It assigns join subtasks to processors during runtime using a method based on the estimation of execution time required for the computation.<>
  • Keywords
    distributed databases; file organisation; merging; data skew; data skew problems; database applications; database operations; distributed database systems; execution time; load imbalance; parallel database; parallel hash join computation; parallel join algorithm; Algorithm design and analysis; Computer science; Concurrent computing; Database systems; Distributed databases; Load management; Parallel processing; Relational databases; Runtime; Solids;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON '93. Proceedings. Computer, Communication, Control and Power Engineering.1993 IEEE Region 10 Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-1233-3
  • Type

    conf

  • DOI
    10.1109/TENCON.1993.319946
  • Filename
    319946