DocumentCode
28744
Title
Scaling Hop-Based Reachability Indexing for Fast Graph Pattern Query Processing
Author
Ronghua Liang ; Hai Zhuge ; Xiaorui Jiang ; Qiang Zeng ; Xiaofei He
Author_Institution
Coll. of Inf. Eng., Zhejiang Univ. of Technol., Hangzhou, China
Volume
26
Issue
11
fYear
2014
fDate
Nov. 2014
Firstpage
2803
Lastpage
2817
Abstract
Graphs are becoming increasingly dominant in modeling real-life networked data including social and biological networks, the WWW and the Semantic Web, etc. Graph pattern queries are useful for gathering information with expressive semantics from these graph-structured data. Current methods for graph pattern query processing have performance deficiency caused by inefficiencies of the underlying reachability index and costly merge-join operations on huge amounts of tuple-formatted intermediate results. To overcome the above problems, this paper contributes in the following aspects to boost graph pattern query evaluation. First, we propose an improved hop-based reachability indexing scheme 3-Hop which gains faster reachability query evaluation, less indexing costs and better scalabilities than state-of-the-art hop-based methods. Second, we propose a two-stage node filtering algorithm based on 3-Hop to answer tree pattern queries more efficiently. Tree pattern queries serve as the underlying facility for graph pattern query evaluation. Furthermore, we use a graph representation of the intermediate results during node filtering and final results enumeration. Experiments on real-life and synthetic datasets demonstrate the effectiveness of the proposed methods.
Keywords
graph theory; indexing; information filtering; query processing; semantic Web; WWW; biological networks; fast graph pattern query processing; graph pattern query evaluation; graph representation; graph-structured data; hop-based reachability indexing scaling; hop-based reachability indexing scheme; merge-join operations; node filtering; performance deficiency; reachability index; real-life networked data; semantic Web; social networks; tree pattern queries; tuple-formatted intermediate results; two-stage node filtering algorithm; Filtering; Indexing; Labeling; Pattern matching; Query processing; Solids; 3-Hop* labeling; Graph pattern query processing; reachability indexing; two-stage node filtering;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2014.2310207
Filename
6763111
Link To Document