Title :
Naive Bayes classification based on negative-case-pruning
Author :
Qin, Feng ; REN, Shiliu ; CHENG, Zekai ; LUO, Hui
Author_Institution :
Sch. of Comput. Sci., Anhui Univ. of Technol., Maanshan
Abstract :
The traditional data mining algorithms behaves undesirable in the instance of imbalanced data sets, as the distribution of the data sets is not taken into consideration when these algorithms are designed. This paper describes the Naive Bayes classification mechanism, and then points out that, random re-sampling methods not to be able to improve its performance. A negative-case-pruning Naive Bayes (NCP_NB) algorithm is proposed to reduce the negative effects of the class imbalance; Experiments with UCI datasets show the validity of NCP_NB.
Keywords :
Bayes methods; data mining; pattern classification; sampling methods; Naive Bayes classification; data mining algorithms; imbalanced data sets; negative-case-pruning; re-sampling methods; Algorithm design and analysis; Computer science; Data mining; Design automation; Diabetes; Glass; Intelligent control; Iris; Vehicles; Voting; AUC; Imbalanced data; Naive Bayes; posterior probability; re-sampling;
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
DOI :
10.1109/WCICA.2008.4593866