DocumentCode :
3518815
Title :
Graph classification with imbalanced data sets
Author :
Xiao, Gang-Song ; Chen, Xiao-Yun
Author_Institution :
Coll. of Math. & Comput. Sci., Fuzhou Univ., Fuzhou, China
fYear :
2011
fDate :
28-28 Nov. 2011
Firstpage :
57
Lastpage :
61
Abstract :
Many graph classification methods have been proposed in recent years. These graph classification methods can perform well with balanced graph data sets, but perform poorly with imbalanced graph data sets. In this paper, we propose a new graph classification method based on cost sensitivity to deal with imbalance. First, we introduce a misclassification cost-matrix, and select the weighted subgraph based on the least misclassification cost as the attribute of graph. Then we build up a decision stump classifier and ensemble learning, finally obtain classify critical function to classify a new graph. Especially we prove that the supergraph of a weighted subgraph has an upper bound. And we can use the upper bound of supergraph to reduce the number of candidate subgraphs, so our method can be very efficient. Moreover we compare our method with other graph classification methods through experiment on imbalanced graph date sets.
Keywords :
data mining; graph theory; learning (artificial intelligence); pattern classification; balanced graph data sets; decision stump classifier; ensemble learning; graph classification methods; imbalanced graph data sets; misclassification cost matrix; weighted subgraph; Accuracy; Algorithm design and analysis; Boosting; Classification algorithms; Kernel; Training; Upper bound; Class imbalance; Cost-sensitive learning; Graph classification; Graph mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ACPR), 2011 First Asian Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-0122-1
Type :
conf
DOI :
10.1109/ACPR.2011.6166613
Filename :
6166613
Link To Document :
بازگشت