Title :
Discovering frequent geometric subgraphs
Author :
Kuramochi, Michihiro ; Karypis, George
Author_Institution :
Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
Abstract :
As data mining techniques are being increasingly applied to non-traditional domains, existing approaches for finding frequent itemsets cannot be used as they cannot model the requirement of these domains. An alternate way of modeling the objects in these data sets, is to use a graph to model the database objects. Within that model, the problem of finding frequent patterns becomes that of discovering subgraphs that occur frequently over the entire set of graphs. We present a computationally efficient algorithm for finding frequent geometric subgraphs in a large collection of geometric graphs. Our algorithm is able to discover geometric subgraphs that can be rotation, scaling and translation invariant, and it can accommodate inherent errors on the coordinates of the vertices. Our experimental results show that our algorithms require relatively little time, can accommodate low support values, and scale linearly on the number of transactions.
Keywords :
data mining; graph theory; pattern recognition; very large databases; computationally efficient algorithm; data mining; data sets; database objects; experimental results; frequent geometric subgraph discovery; frequent itemsets; frequent pattern discovery; transactions; very large databases; Chemical compounds; Computer science; Contracts; Data mining; High performance computing; Itemsets; Military computing; Spatial databases; Transaction databases; US Department of Energy;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1183911