DocumentCode
1042858
Title
A Novel Bayes Model: Hidden Naive Bayes
Author
Jiang, Liangxiao ; Zhang, Harry ; Cai, Zhihua
Author_Institution
Fac. of Comput. Sci., China Univ. of Geosci., Wuhan, China
Volume
21
Issue
10
fYear
2009
Firstpage
1361
Lastpage
1371
Abstract
Because learning an optimal Bayesian network classifier is an NP-hard problem, learning-improved naive Bayes has attracted much attention from researchers. In this paper, we summarize the existing improved algorithms and propose a novel Bayes model: hidden naive Bayes (HNB). In HNB, a hidden parent is created for each attribute which combines the influences from all other attributes. We experimentally test HNB in terms of classification accuracy, using the 36 UCI data sets selected by Weka, and compare it to naive Bayes (NB), selective Bayesian classifiers (SBC), naive Bayes tree (NBTree), tree-augmented naive Bayes (TAN), and averaged one-dependence estimators (AODE). The experimental results show that HNB significantly outperforms NB, SBC, NBTree, TAN, and AODE. In many data mining applications, an accurate class probability estimation and ranking are also desirable. We study the class probability estimation and ranking performance, measured by conditional log likelihood (CLL) and the area under the ROC curve (AUC), respectively, of naive Bayes and its improved models, such as SBC, NBTree, TAN, and AODE, and then compare HNB to them in terms of CLL and AUC. Our experiments show that HNB also significantly outperforms all of them.
Keywords
belief networks; computational complexity; data mining; learning (artificial intelligence); probability; Bayes model; NP-hard problem; ROC curve; UCI data set; averaged one-dependence estimator; class probability estimation; conditional log likelihood; data mining application; hidden naive Bayes model; naive Bayes tree; optimal Bayesian network classifier; selective Bayesian classifier; tree-augmented naive Bayes; Bayesian network classifiers; Naive Bayes; class probability estimation; classification; learning algorithms; ranking.;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2008.234
Filename
4721435
Link To Document