DocumentCode :
3228119
Title :
Sampled Bayesian Network Classifiers for Class-Imbalance and Cost-Sensitive Learning
Author :
Liangxiao Jiang ; Chaoqun Li ; Zhihua Cai ; Zhang, Haijun
Author_Institution :
Dept. of Comput. Sci., China Univ. of Geosci., Wuhan, China
fYear :
2013
fDate :
4-6 Nov. 2013
Firstpage :
512
Lastpage :
517
Abstract :
In many real-world applications, it is often the case that the class distribution of instances is imbalanced and the costs of misclassification are different. Thus, class-imbalance and cost-sensitive learning have attracted much attention from researchers. Sampling is one of the widely used approaches in dealing with the class imbalance problem, which alters the class distribution of instances so that the minority class is well represented in the training data. In this paper, we study the effect of sampling the natural training data on state-of-the-art Bayesian network classifiers, such as Naive Bayes (NB), Tree Augmented Naïve Bayes (TAN), Averaged One-Dependence Estimators (AODE), Weighted Average of One-Dependence Estimators (WAODE), and Hidden naive Bayes (HNB) and propose sampled Bayesian network classifiers. Our experimental results on a large number of UCI datasets show that our sampled Bayesian network classifiers perform much better than the ones trained from the natural training data especially when the natural training data is highly imbalanced and the cost ratio is high enough.
Keywords :
Bayes methods; pattern classification; trees (mathematics); HNB classifier; TAN classifier; UCI datasets; WAODE classifier; averaged one-dependence estimator classifier; class distribution; class imbalance problem; cost-sensitive learning; hidden naive Bayes classifier; misclassification cost; natural training data sampling; sampled Bayesian network classifiers; tree augmented naïve Bayes classifier; weighted average of one-dependence estimator classifier; Bayes methods; Breast cancer; Educational institutions; Ionosphere; Niobium; Training; Training data; Bayesian network classifiers; class-imbalance learning; cost-sensitive learning; misclassification cost; true positive rate;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
Conference_Location :
Herndon, VA
ISSN :
1082-3409
Print_ISBN :
978-1-4799-2971-9
Type :
conf
DOI :
10.1109/ICTAI.2013.82
Filename :
6735293
Link To Document :
بازگشت