DocumentCode :
140751
Title :
Query optimization of distributed pattern matching
Author :
Jiewen Huang ; Venkatraman, K. ; Abadi, Daniel J.
Author_Institution :
Yale Univ., New Haven, CT, USA
fYear :
2014
fDate :
March 31 2014-April 4 2014
Firstpage :
64
Lastpage :
75
Abstract :
Greedy algorithms for subgraph pattern matching operations are often sufficient when the graph data set can be held in memory on a single machine. However, as graph data sets increasingly expand and require external storage and partitioning across a cluster of machines, more sophisticated query optimization techniques become critical to avoid explosions in query latency. In this paper, we introduce several query optimization techniques for distributed graph pattern matching. These techniques include (1) a System-R style dynamic programming-based optimization algorithm that considers both linear and bushy plans, (2) a cycle detection-based algorithm that leverages cycles to reduce intermediate result set sizes, and (3) a computation reusing technique that eliminates redundant query execution and data transfer over the network. Experimental results show that these algorithms can lead to an order of magnitude improvement in query performance.
Keywords :
distributed processing; dynamic programming; pattern matching; query processing; cycle detection based algorithm; distributed graph pattern matching; dynamic programming; external storage; greedy algorithms; query latency; query optimization techniques; subgraph pattern matching operations; Data models; Distributed databases; Heuristic algorithms; Optimization; Partitioning algorithms; Pattern matching; Query processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/ICDE.2014.6816640
Filename :
6816640
Link To Document :
بازگشت