DocumentCode :
2771260
Title :
Efficient Discovery of Frequent Correlated Subgraph Pairs
Author :
Ke, Yiping ; Cheng, James ; Yu, Jeffrey Xu
Author_Institution :
Chinese Univ. of Hong Kong, Hong Kong, China
fYear :
2009
fDate :
6-9 Dec. 2009
Firstpage :
239
Lastpage :
248
Abstract :
The recent proliferation of graph data in a wide spectrum of applications has led to an increasing demand for advanced data analysis techniques. In view of this, many graph mining techniques, such as frequent subgraph mining and correlated subgraph mining, have been proposed. In many applications, both frequency and correlation play an important role. Thus, this paper studies a new problem of mining the set of frequent correlated subgraph pairs. A simple algorithm that combines existing algorithms for mining frequent subgraphs and correlated subgraphs results in a multiplication of the mining operations, the majority of which are redundant. We discover that most of the graphs correlated to a common graph are also highly correlated. We establish theoretical foundations for this finding and derive a tight lower bound on the correlation of any two graphs that are correlated to a common graph. This theoretical result leads to the design of a very effective skipping mechanism, by which we skip the processing of a majority of graphs in the mining process. Our algorithm, FCP-Miner, is a fast approximate algorithm, but we show that the missing pairs are only a small set of marginally correlated pairs. Extensive experiments verify both the efficiency and effectiveness of FCP-Miner.
Keywords :
data analysis; data mining; graph theory; FCP-Miner; advanced data analysis techniques; correlated subgraph mining; frequent correlated subgraph pair discovery; frequent subgraph mining; graph data; skipping mechanism; Biochemical analysis; Bioinformatics; Chemistry; Data analysis; Data mining; Databases; Drugs; Frequency; Pattern recognition; Social network services; Pearson´s correlation coefficient; frequent correlated subgraph pairs; graph mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
ISSN :
1550-4786
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2009.54
Filename :
5360249
Link To Document :
بازگشت