• DocumentCode
    3143584
  • Title

    Adding regular expressions to graph reachability and pattern queries

  • Author

    Fan, Wenfei ; Li, Jianzhong ; Ma, Shuai ; Tang, Nan ; Wu, Yinghui

  • Author_Institution
    Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2011
  • fDate
    11-16 April 2011
  • Firstpage
    39
  • Lastpage
    50
  • Abstract
    It is increasingly common to find graphs in which edges bear different types, indicating a variety of relationships. For such graphs we propose a class of reachability queries and a class of graph patterns, in which an edge is specified with a regular expression of a certain form, expressing the connectivity in a data graph via edges of various types. In addition, we define graph pattern matching based on a revised notion of graph simulation. On graphs in emerging applications such as social networks, we show that these queries are capable of finding more sensible information than their traditional counterparts. Better still, their increased expressive power does not come with extra complexity. Indeed, (1) we investigate their containment and minimization problems, and show that these fundamental problems are in quadratic time for reachability queries and are in cubic time for pattern queries. (2) We develop an algorithm for answering reachability queries, in quadratic time as for their traditional counterpart. (3) We provide two cubic-time algorithms for evaluating graph pattern queries based on extended graph simulation, as opposed to the NP-completeness of graph pattern matching via subgraph isomorphism. (4) The effectiveness, efficiency and scalability of these algorithms are experimentally verified using real-life data and synthetic data.
  • Keywords
    computational complexity; pattern matching; query processing; question answering (information retrieval); reachability analysis; NP-completeness; cubic-time algorithms; extended graph simulation; graph pattern matching; graph reachability pattern; pattern queries; reachability query answering; regular expressions; subgraph isomorphism; Biology; Cloning; Complexity theory; Image color analysis; Minimization; Pattern matching; Semantics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2011 IEEE 27th International Conference on
  • Conference_Location
    Hannover
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4244-8959-6
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2011.5767858
  • Filename
    5767858