• DocumentCode
    28744
  • Title

    Scaling Hop-Based Reachability Indexing for Fast Graph Pattern Query Processing

  • Author

    Ronghua Liang ; Hai Zhuge ; Xiaorui Jiang ; Qiang Zeng ; Xiaofei He

  • Author_Institution
    Coll. of Inf. Eng., Zhejiang Univ. of Technol., Hangzhou, China
  • Volume
    26
  • Issue
    11
  • fYear
    2014
  • fDate
    Nov. 2014
  • Firstpage
    2803
  • Lastpage
    2817
  • Abstract
    Graphs are becoming increasingly dominant in modeling real-life networked data including social and biological networks, the WWW and the Semantic Web, etc. Graph pattern queries are useful for gathering information with expressive semantics from these graph-structured data. Current methods for graph pattern query processing have performance deficiency caused by inefficiencies of the underlying reachability index and costly merge-join operations on huge amounts of tuple-formatted intermediate results. To overcome the above problems, this paper contributes in the following aspects to boost graph pattern query evaluation. First, we propose an improved hop-based reachability indexing scheme 3-Hop which gains faster reachability query evaluation, less indexing costs and better scalabilities than state-of-the-art hop-based methods. Second, we propose a two-stage node filtering algorithm based on 3-Hop to answer tree pattern queries more efficiently. Tree pattern queries serve as the underlying facility for graph pattern query evaluation. Furthermore, we use a graph representation of the intermediate results during node filtering and final results enumeration. Experiments on real-life and synthetic datasets demonstrate the effectiveness of the proposed methods.
  • Keywords
    graph theory; indexing; information filtering; query processing; semantic Web; WWW; biological networks; fast graph pattern query processing; graph pattern query evaluation; graph representation; graph-structured data; hop-based reachability indexing scaling; hop-based reachability indexing scheme; merge-join operations; node filtering; performance deficiency; reachability index; real-life networked data; semantic Web; social networks; tree pattern queries; tuple-formatted intermediate results; two-stage node filtering algorithm; Filtering; Indexing; Labeling; Pattern matching; Query processing; Solids; 3-Hop* labeling; Graph pattern query processing; reachability indexing; two-stage node filtering;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2310207
  • Filename
    6763111