DocumentCode
2880
Title
Algebraic Optimization for Processing Graph Pattern Queries in the Cloud
Author
Anyanwu, Kemafor ; Kim, HyeongSik ; Ravindra, Padmashree
Author_Institution
North Carolina State University
Volume
17
Issue
2
fYear
2013
fDate
March-April 2013
Firstpage
52
Lastpage
61
Abstract
MapReduce platforms such as Hadoop are now the de facto standard for large-scale data processing, but they have significant limitations for join-intensive workloads typical in Semantic Web processing. This article overviews an algebraic optimization approach based on a Nested TripleGroup Data Model and Algebra (NTGA) that minimizes overall processing costs by reducing the number of MapReduce cycles. It also presents an approach for integrating NTGA-based processing of graph pattern queries into Apache Pig and compares it to execution plans using relational-style algebra operators.
Keywords
Data processing; Optimization; Query processing; Resource description framework; database management; information technology and systems; query languages; query processing;
fLanguage
English
Journal_Title
Internet Computing, IEEE
Publisher
ieee
ISSN
1089-7801
Type
jour
DOI
10.1109/MIC.2012.22
Filename
6138841
Link To Document