DocumentCode :
3250402
Title :
Computing frequent graph patterns from semistructured data
Author :
Vanetik, N. ; Gudes, E. ; Shimony, S.E.
Author_Institution :
Dept. of Comput. Sci., Ben-Gurion Univ. of the Negev, Beer-Sheva, Israel
fYear :
2002
fDate :
2002
Firstpage :
458
Lastpage :
465
Abstract :
Whereas data mining in structured data focuses on frequent data values, in semistructured and graph data the emphasis is on frequent labels and common topologies. Here, the structure of the data is just as important as its content. We study the problem of discovering typical patterns of graph data. The discovered patterns can be useful for many applications, including: compact representation of source information and a road-map for browsing and querying information sources. Difficulties arise in the discovery task from the complexity of some of the required sub-tasks, such as sub-graph isomorphism. This paper proposes a new algorithm for mining graph data, based on a novel definition of support. Empirical evidence shows practical, as well as theoretical, advantages of our approach.
Keywords :
data mining; graphs; common topologies; compact source information representation; complexity; data mining; frequent graph pattern computation; frequent labels; graph data; information source browsing; information source querying; pattern discovery; semistructured data; sub-graph isomorphism; Association rules; Computer science; Data mining; Databases; Frequency; Indexing; Topology; Tree graphs; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
Type :
conf
DOI :
10.1109/ICDM.2002.1183988
Filename :
1183988
Link To Document :
بازگشت