• DocumentCode
    1042858
  • Title

    A Novel Bayes Model: Hidden Naive Bayes

  • Author

    Jiang, Liangxiao ; Zhang, Harry ; Cai, Zhihua

  • Author_Institution
    Fac. of Comput. Sci., China Univ. of Geosci., Wuhan, China
  • Volume
    21
  • Issue
    10
  • fYear
    2009
  • Firstpage
    1361
  • Lastpage
    1371
  • Abstract
    Because learning an optimal Bayesian network classifier is an NP-hard problem, learning-improved naive Bayes has attracted much attention from researchers. In this paper, we summarize the existing improved algorithms and propose a novel Bayes model: hidden naive Bayes (HNB). In HNB, a hidden parent is created for each attribute which combines the influences from all other attributes. We experimentally test HNB in terms of classification accuracy, using the 36 UCI data sets selected by Weka, and compare it to naive Bayes (NB), selective Bayesian classifiers (SBC), naive Bayes tree (NBTree), tree-augmented naive Bayes (TAN), and averaged one-dependence estimators (AODE). The experimental results show that HNB significantly outperforms NB, SBC, NBTree, TAN, and AODE. In many data mining applications, an accurate class probability estimation and ranking are also desirable. We study the class probability estimation and ranking performance, measured by conditional log likelihood (CLL) and the area under the ROC curve (AUC), respectively, of naive Bayes and its improved models, such as SBC, NBTree, TAN, and AODE, and then compare HNB to them in terms of CLL and AUC. Our experiments show that HNB also significantly outperforms all of them.
  • Keywords
    belief networks; computational complexity; data mining; learning (artificial intelligence); probability; Bayes model; NP-hard problem; ROC curve; UCI data set; averaged one-dependence estimator; class probability estimation; conditional log likelihood; data mining application; hidden naive Bayes model; naive Bayes tree; optimal Bayesian network classifier; selective Bayesian classifier; tree-augmented naive Bayes; Bayesian network classifiers; Naive Bayes; class probability estimation; classification; learning algorithms; ranking.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.234
  • Filename
    4721435