Title :
Finding patterns in labeled graphs using spectrum feature vectors in a SOM network
Author :
Fonseca, Rigoberto ; Gómez-Gil, Pilar ; González, Jesús A. ; Olmos, Iván
Author_Institution :
Nat. Inst. of Astrophys., Opt. & Electron., Tonantzintla, Mexico
fDate :
July 31 2011-Aug. 5 2011
Abstract :
Knowledge discovery in structured databases is very important nowadays. In the last years, graph-based data mining algorithms have used artificial neural networks as tools to support clustering. Several of these algorithms have obtained promising results, but they show expensive computational costs. In this work we introduce an algorithm for clustering graphs based on a SOM network, which is part of a process for discovering useful frequent patterns in large graph databases. Our algorithm is able to handle non-directed, cyclic graphs with labels in vertices and edges. An important characteristic is that it presents polynomial computational complexity, because it uses as input a feature vector built with the spectra of the Laplacian of an adjacent matrix. Such matrix contains codes representing the labels in the graph, which preserves the semantic information included in the graphs to be grouped. We tested our algorithm in a small set of graphs and in a large structured database, finding that it creates meaningful groups of graphs.
Keywords :
computational complexity; data mining; graph theory; matrix algebra; pattern clustering; polynomials; self-organising feature maps; SOM network; adjacent matrix; artificial neural networks; feature vector; graph clustering; graph-based data mining algorithms; knowledge discovery; labeled graphs; nondirected cyclic graphs; polynomial computational complexity; self-organizing map; spectrum feature vectors; structured databases; Algorithm design and analysis; Clustering algorithms; Complexity theory; Databases; Laplace equations; Neurons; Training;
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-9635-8
DOI :
10.1109/IJCNN.2011.6033358