Title :
FGMAC: Frequent subgraph mining with Arc Consistency
Author :
Douar, Brahim ; Liquiere, Michel ; Latiri, Chiraz ; Slimani, Yahya
Author_Institution :
LIRMM, Montpellier, France
Abstract :
With the important growth of requirements to analyze large amount of structured data such as chemical compounds, proteins structures, XML documents, to cite but a few, graph mining has become an attractive track and a real challenge in the data mining field. Among the various kinds of graph patterns, frequent subgraphs seem to be relevant in characterizing graphsets, discriminating different groups of sets, and classifying and clustering graphs. Because of the NP-Completeness of subgraph isomorphism test as well as the huge search space, fragment miners are exponential in runtime and/or memory consumption. In this paper we study a new polynomial projection operator named AC-Projection based on a key technique of constraint programming namely Arc Consistency (AC). This is intended to replace the use of the exponential subgraph isomorphism. We study the relevance of frequent AC-reduced graph patterns on classification and we prove that we can achieve an important performance gain without or with non-significant loss of discovered pattern´s quality.
Keywords :
computational complexity; data integrity; data mining; graph theory; pattern classification; polynomials; set theory; AC-Projection; FGMAC; NP-completeness; arc consistency; clustering graph; constraint programming; data mining; exponential subgraph isomorphism; fragment miners; frequent AC-reduced graph pattern; frequent subgraph mining; graph pattern classification; graphsets; memory consumption; polynomial projection operator; search space; structured data; Complexity theory; Data mining; Databases; Labeling; Memory management; Polynomials; Runtime; AC-projection; Graph classification; Graph mining;
Conference_Titel :
Computational Intelligence and Data Mining (CIDM), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9926-7
DOI :
10.1109/CIDM.2011.5949436