DocumentCode
48413
Title
Bag Constrained Structure Pattern Mining for Multi-Graph Classification
Author
Jia Wu ; Xingquan Zhu ; Chengqi Zhang ; Yu, Philip S.
Author_Institution
Centre for Quantum Comput. & Intell. Syst., Univ. of Technol. Sydney, Sydney, NSW, Australia
Volume
26
Issue
10
fYear
2014
fDate
Oct. 2014
Firstpage
2382
Lastpage
2396
Abstract
This paper formulates a multi-graph learning task. In our problem setting, a bag contains a number of graphs and a class label. A bag is labeled positive if at least one graph in the bag is positive, and negative otherwise. In addition, the genuine label of each graph in a positive bag is unknown, and all graphs in a negative bag are negative. The aim of multi-graph learning is to build a learning model from a number of labeled training bags to predict previously unseen test bags with maximum accuracy. This problem setting is essentially different from existing multi-instance learning (MIL), where instances in MIL share well-defined feature values, but no features are available to represent graphs in a multi-graph bag. To solve the problem, we propose a Multi-Graph Feature based Learning (gMGFL) algorithm that explores and selects a set of discriminative subgraphs as features to transfer each bag into a single instance, with the bag label being propagated to the transferred instance. As a result, the multi-graph bags form a labeled training instance set, so generic learning algorithms, such as decision trees, can be used to derive learning models for multi-graph classification. Experiments and comparisons on real-world multi-graph tasks demonstrate the algorithm performance.
Keywords
data mining; graph theory; learning (artificial intelligence); pattern classification; MIL; bag constrained structure pattern mining; bag label; class label; decision trees; discriminative subgraphs; gMGFL algorithm; labeled training bags; labeled training instance set; multigraph bag; multigraph classification; multigraph feature based learning algorithm; multigraph learning model; multigraph learning task; multiinstance learning; Bismuth; Educational institutions; Electronic mail; Laplace equations; Supervised learning; Training; Vectors; Graph classification; multi-graph, subgraph features; multi-instance learning;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2013.2297923
Filename
6702420
Link To Document