DocumentCode :
140836
Title :
Continuous pattern detection over billion-edge graph using distributed framework
Author :
Jun Gao ; Chang Zhou ; Jiashuai Zhou ; Yu, Jeffrey Xu
Author_Institution :
Key Lab. of High Confidence Software Technol., Peking Univ., Beijing, China
fYear :
2014
fDate :
March 31 2014-April 4 2014
Firstpage :
556
Lastpage :
567
Abstract :
Continuous pattern detection plays an important role in monitoring-related applications. The large size and dynamic update of graphs, along with the massive search space, pose huge challenges in developing an efficient continuous pattern detection system. In this paper, we leverage a distributed graph processing framework to approximately detect a given pattern over a large dynamic graph. We aim to improve the scalability and precision, and reduce the response time and message cost in the detection. We convert a given query pattern into a Single-Sink DAG (Directed Acyclic Graph), and propose an evaluation plan with message transitions on the DAG, which is shorten by SSD plan, to detect the pattern in a large dynamic graph. SSD plan can guide the data graph exploration via messages, and the messages will converge at data sink vertices, which then detect existences of the query pattern. We also conduct join operations over partial vertices during the graph exploration to improve the precision of pattern detection. In addition, we show that SSD plan can support the continuous query over dynamic graphs with slight extensions. We further design various sink vertex selection strategies and neighborhood based transition rule attachment to lower the evaluation cost. The experiments on billion-edge real-life graphs using Giraph, an open source implementation of Pregel, illustrate the efficiency and effectiveness of our method.
Keywords :
directed graphs; distributed processing; query processing; Giraph; Pregel; SSD plan; billion-edge graph; continuous pattern detection system; data graph exploration; directed acyclic graph; distributed graph processing framework; large dynamic graph; massive search space; neighborhood based transition rule attachment; query pattern; single-sink DAG; sink vertex selection strategies; Biomedical monitoring; Drives; Image edge detection; Monitoring; Parallel processing; Pattern matching; Transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/ICDE.2014.6816681
Filename :
6816681
Link To Document :
بازگشت