Title :
Parallel processing of “GroupBy-Before-Join” queries in cluster architecture
Author :
Taniar, David ; Rahayu, J. Wenny
Author_Institution :
Sch. of Bus. Syst., Monash Univ., Clayton, Vic., Australia
Abstract :
SQL queries in the real world are replete with group-by and join operations. This type of queries is often known as “GroupBy-Join” queries. In some GroupBy-Join queries, it is desirable to perform group-by before join in order to achieve better performance. This subset of GroupBy-Join queries is called “GroupBy-Before-Join” queries. In this paper, we present a study on the parallelization of GroupBy-Before-Join queries, particularly by exploiting cluster architectures. From our study, we have learned that, in parallel query optimization, processing group-by operations as early as possible is not always desirable. On many occasions, performing data distribution first, before group-by, offers performance advantages. In this study, we also describe our cluster-based scheme
Keywords :
parallel architectures; parallel databases; query processing; software performance evaluation; workstation clusters; GroupBy-Before-Join queries; GroupBy-Join queries; SQL queries; cluster architecture; data distribution; group-by operations; join operations; parallel processing; parallel query optimization; parallelization; performance; Aggregates; Australia; Computer architecture; Data warehouses; Databases; Decision making; Parallel architectures; Parallel processing; Performance gain; Query processing;
Conference_Titel :
Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-7695-1010-8
DOI :
10.1109/CCGRID.2001.923191