Title :
Efficient Query Processing for Large XML Data in Distributed Environments
Author :
Kurita, Hiroto ; Hatano, Kenji ; Miyazaki, Jun ; Uemura, Shunsuke
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma
Abstract :
We propose an efficient distributed query processing method for large XML data by partitioning and distributing XML data to multiple computation nodes. There are several steps involved in this method; however, we focused particularly on XML data partitioning and dynamic relocation of partitioned XML data in our research. Since the efficiency of query processing depends on both XML data size and its structure, these factors should be considered when XML data is partitioned. Each partitioned XML data is distributed to computation nodes so that the CPU load can be balanced. In addition, it is important to take account of the query workload among each of the computation nodes because it is closely related to the query processing cost in distributed environments. In case of load skew among computation nodes, partitioned XML data should be relocated to balance the CPU load. Thus, we implemented an algorithm for relocating partitioned XML data based on the CPU load of query processing. From our experiments, we found that there is a performance advantage in our approach for executing distributed query processing of large XML data.
Keywords :
XML; distributed processing; query processing; XML data partitioning; distributed environments; distributed query processing method; extensible markup language; query workload; Books; Costs; Distributed computing; Information science; Large-scale systems; Partitioning algorithms; Query processing; Software libraries; Tree data structures; XML;
Conference_Titel :
Advanced Information Networking and Applications, 2007. AINA '07. 21st International Conference on
Conference_Location :
Niagara Falls, ON
Print_ISBN :
0-7695-2846-5
DOI :
10.1109/AINA.2007.64