DocumentCode :
243605
Title :
Quick Mining of Isomorphic Exact Large Patterns from Large Graphs
Author :
Almasri, Islam ; Xin Gao ; Fedoroff, Nina
Author_Institution :
Comput., Electr. & Math. Sci., King Abdullah Univ. of Sci. & Technol. (KAUST), Thuwal, Saudi Arabia
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
517
Lastpage :
524
Abstract :
The applications of the sub graph isomorphism search are growing with the growing number of areas that model their systems using graphs or networks. Specifically, many biological systems, such as protein interaction networks, molecular structures and protein contact maps, are modeled as graphs. The sub graph isomorphism search is concerned with finding all sub graphs that are isomorphic to a relevant query graph, the existence of such sub graphs can reflect on the characteristics of the modeled system. The most computationally expensive step in the search for isomorphic sub graphs is the backtracking algorithm that traverses the nodes of the target graph. In this paper, we propose a pruning approach that is inspired by the minimum remaining value heuristic that achieves greater scalability over large query and target graphs. Our testing on various biological networks shows that performance enhancement of our approach over existing state-of-the-art approaches varies between 6x and 53x.
Keywords :
backtracking; biology computing; data mining; graph theory; search problems; backtracking algorithm; biological networks; biological systems; isomorphic exact large pattern; large graph; minimum remaining value heuristic; pruning approach; query graph; quick mining; subgraph isomorphism search; Biological system modeling; Filtering; Heuristic algorithms; Image color analysis; Proteins; Scalability; Graphs; Isomorphism; Mining Networks; Subgraphs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.65
Filename :
7022640
Link To Document :
بازگشت