DocumentCode
1478505
Title
Continuous Subgraph Pattern Search over Certain and Uncertain Graph Streams
Author
Chen, Lei ; Wang, Changliang
Author_Institution
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Volume
22
Issue
8
fYear
2010
Firstpage
1093
Lastpage
1109
Abstract
Search over graph databases has attracted much attention recently due to its usefulness in many fields, such as the analysis of chemical compounds, intrusion detection in network traffic data, and pattern matching over users´ visiting logs. However, most of the existing works focus on search over static graph databases, while in many real applications, graphs are changing over time. In this paper, we investigate a new problem on continuous subgraph pattern search under the situation where multiple target graphs are constantly changing in a stream style, namely, the subgraph pattern search over graph streams. Obviously, the proposed problem is a continuous join between query patterns and graph streams where the join predicate is the existence of subgraph isomorphism. Due to the NP-completeness of subgraph isomorphism checking, to achieve the real-time monitoring of the existence of certain subgraph patterns, we would like to avoid using subgraph isomorphism verification to find the exact query-stream subgraph isomorphic pairs but to offer an approximate answer that could report all probable pairs without missing any actual answer pairs. Therefore, we propose a lightweight yet effective feature structure called Node-Neighbor Tree to filter out false candidate query-stream pairs. To reduce the computational cost, we propose a novel idea, projecting the feature structures into a numerical vector space and conducting dominant relationship checking in the projected space. We design two methods to efficiently verify dominant relationships, and thus, answer the subgraph search over graph streams efficiently. In addition to answering queries over certain graph streams, we propose a novel problem, detecting the appearance of subgraph patterns over uncertain graph streams with high probability (i.e., larger than the probability threshold specified by users). To address this problem, we not only extend the proposed solutions for certain graphs streams, but also propose a new p- - runing technique by utilizing the probability threshold. We substantiate our methods with extensive experiments on both certain and uncertain graph streams.
Keywords
database management systems; formal verification; query processing; tree data structures; NP-completeness; candidate query-stream pairs; certain graph streams; continuous subgraph pattern search; dominant relationship checking; graph databases; node-neighbor tree; pruning technique; query patterns; subgraph isomorphism; subgraph isomorphism checking; subgraph isomorphism verification; uncertain graph streams; Subgraph search; graph streams; node-neighbor tree; uncertain graph streams.;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2010.67
Filename
5453378
Link To Document