DocumentCode :
3422323
Title :
Querying XML Data using PC Cluster System
Author :
Amagasa, Toshiyuki ; Kido, Kentarou ; Kitagawa, Hiroyuki
Author_Institution :
Univ. of Tsukuba, Tsukuba
fYear :
2007
fDate :
3-7 Sept. 2007
Firstpage :
5
Lastpage :
9
Abstract :
This paper proposes a novel approach for querying large-scale XML data using PC cluster system. With the recent spread of the XML format, large-scale data coded in XML ranging from several hundreds of megabytes to several gigabytes has become common. However, XML databases are often innefficient in dealing with huge XML data. The problem is the complexity of the XML data model and query processing. To cope with this problem, we attempt to construct a parallel XML database on top of a PC cluster system. To this end, we discuss XML data partitioning to enable parallel processing of XML queries. We introduce a path-based partitioning for XML data. The obtained XML fragments are then allocated to cluster nodes. To obtain cost-efficient allocation of the fragments, we discuss cost functions for parallel XPath processing and an algorithm to compute pseudo-optimal allocation, which is based on the well-known genetic algorithm. Finally, we demonstrate effectiveness of the proposed scheme by a series of experiments.
Keywords :
XML; distributed databases; genetic algorithms; parallel processing; query processing; workstation clusters; PC cluster system; XML data partitioning; XML data querying; genetic algorithm; parallel XML database; parallel XPath processing; parallel processing; path-based partitioning; pseudo-optimal allocation; query processing; Clustering algorithms; Concurrent computing; Cost function; Data models; Databases; Large-scale systems; Parallel processing; Partitioning algorithms; Query processing; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2007. DEXA '07. 18th International Workshop on
Conference_Location :
Regensburg
ISSN :
1529-4188
Print_ISBN :
978-0-7695-2932-5
Type :
conf
DOI :
10.1109/DEXA.2007.108
Filename :
4312846
Link To Document :
بازگشت