DocumentCode :
3426227
Title :
Faster computation of the direct product kernel for graph classification
Author :
Ketkar, Nikhil S. ; Holder, Lawrence B. ; Cook, Diane J.
fYear :
2009
fDate :
March 30 2009-April 2 2009
Firstpage :
267
Lastpage :
274
Abstract :
The direct product kernel, introduced by Gartner et al. for graph classification, is based on defining a feature for every possible label sequence in a labelled graph and counting how many label sequences in two given graphs are identical. Although the direct product kernel has achieved promising results in terms of accuracy, the kernel computation is not feasible for large graphs. This is because computing the direct product kernel for two graphs is essentially computing either the inverse of or by diagonalizing the adjacency matrix of the direct product of these two graphs. For two graphs with adjacency matrices of sizes m and n, the adjacency matrix of their direct product graph can be of size mn in the worst case. As both matrix inversion or matrix diagonalizing in the general case is O(n3), computing the direct product kernel is O((mn)3). Our survey of data sets in graph classification indicates that most graphs have adjacency matrices of sizes in the order of hundreds which often leads to adjacency matrices of direct product graphs (of two graphs) having sizes in the order of thousands. In this work we show how the direct product kernel can be computed in O((m + n)3). The key insight behind our result is that the language of label sequences in a labeled graph is a regular language and that regular languages are closed under union and intersection.
Keywords :
computational complexity; graph theory; matrix algebra; adjacency matrix; direct product kernel; graph classification; label sequence; matrix diagonalization; matrix inversion; Boosting; Kernel; Support vector machine classification; Support vector machines; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining, 2009. CIDM '09. IEEE Symposium on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2765-9
Type :
conf
DOI :
10.1109/CIDM.2009.4938659
Filename :
4938659
Link To Document :
بازگشت