DocumentCode
595335
Title
Top-k correlated subgraph query for data streams
Author
Shirui Pan ; Xingquan Zhu ; Meng Fang
Author_Institution
Centre for Quantum Comput. & Intell. Syst., Univ. of Technol. Sydney, Sydney, NSW, Australia
fYear
2012
fDate
11-15 Nov. 2012
Firstpage
2906
Lastpage
2909
Abstract
Given a query graph q, correlated subgraph query intends to find graph structures which are mostly correlated to the query q. This problem is fundamental for many pattern recognition applications involving structured data like graphs. Current available studies on correlation mining from graph data are all designed for static datasets. However, in real-life applications, data may arrive continuously in a streaming fashion with high speed. In this paper we investigate the problem of top-k correlated subgraph query over stream. By employing Hoeffding bound into the candidate discovery process and carefully maintaining a candidate list over stream, a novel algorithm, Hoe-PG, is proposed to incrementally identify the top-k correlated subgraphs in a sliding window over stream. Experiments show that the proposed method is several times more efficient than its peer with respect to the runtime and the memory consumption, and is able to maintain high precision and recall for stream-based graph query.
Keywords
data mining; graph theory; query processing; Hoe-PG; Hoeffding bound; candidate discovery process; candidate list; correlation mining; data streams; graph data; graph structures; pattern recognition applications; sliding window; static datasets; stream-based graph query; top-k correlated subgraph query; Correlation; Data mining; Databases; Estimation; Frequency estimation; Runtime;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location
Tsukuba
ISSN
1051-4651
Print_ISBN
978-1-4673-2216-4
Type
conf
Filename
6460773
Link To Document