Title :
Selection of nodes for distributing relations in parallel database
Author_Institution :
Sch. of Comput. Sci. & Technol., Heilongjiang Univ., Harbin, China
Abstract :
In parallel database system, a good data placement could improve execution efficiency of multi-join queries greatly. The bandwidth of network communication is always the bottleneck of parallel database system based on PC clusters. Data communication among nodes would bring more time cost when executing join operations. This paper proposes selection of nodes algorithm, which takes the data redistribution into consideration and reduces additional communication cost. Furthermore, it takes into account intra-operator parallelism, independent inter-operator parallelism and pipelined parallelism in order to develop parallelisms of PC clusters system. The result of experiment indicates the algorithm has good performance and contributes to promoting execution efficiency of parallel multi-join queries.
Keywords :
parallel databases; pipeline processing; data communication; distributing relations; interoperator parallelism; network communication bandwidth; parallel database; pipelined parallelism; Bandwidth; Clustering algorithms; Costs; Data communication; Database systems; Multidimensional systems; Parallel processing; Pipelines; Relational databases; Robotics and automation; PC clusters; inter-operator parallelism; multi-join query; pipelined parallelism;
Conference_Titel :
Informatics in Control, Automation and Robotics (CAR), 2010 2nd International Asia Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5192-0
Electronic_ISBN :
1948-3414
DOI :
10.1109/CAR.2010.5456880