• DocumentCode
    1813284
  • Title

    Load balancing in pipelined processing of multi-join queries

  • Author

    Lu, Hongjun ; Kian-Lee Tan ; Lee, Chiang

  • Author_Institution
    Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
  • fYear
    1994
  • fDate
    19-22 Dec 1994
  • Firstpage
    670
  • Lastpage
    675
  • Abstract
    Looks at how to effectively exploit pipelining for multi-join queries in shared-nothing systems. A multi-join query can be processed using an iterative approach. In each iteration, several relations are selected and are joined in a pipelined fashion. However, algorithms that are based on this approach have traditionally assumed that the relations are uniformly distributed or only slightly skewed. When this assumption is relaxed, i.e. when the data is skewed, some nodes may be assigned a larger amount of data than can fit into their memories. As such, pipelining cannot be effectively exploited, and performance may degenerate drastically. We propose four skew handling techniques to deal with data skew for multi-join queries. The results of a performance study show that a hybrid technique is superior in most cases
  • Keywords
    distributed databases; pipeline processing; query processing; relational databases; resource allocation; software performance evaluation; data skew; hybrid technique; iterative approach; load balancing; multi-join queries; performance degeneration; pipelined processing; relation joining; shared-nothing systems; Communication networks; Computer science; Database systems; Information systems; Iterative algorithms; Iterative methods; Load management; Pipeline processing; System performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems, 1994. International Conference on
  • Conference_Location
    Hsinchu
  • Print_ISBN
    0-8186-6555-6
  • Type

    conf

  • DOI
    10.1109/ICPADS.1994.590427
  • Filename
    590427