• DocumentCode
    71212
  • Title

    Non-Naive Bayesian Classifiers for Classification Problems With Continuous Attributes

  • Author

    Xi-Zhao Wang ; Yu-Lin He ; Wang, Da Da

  • Author_Institution
    Machine Learning Center, Hebei Univ., Baoding, China
  • Volume
    44
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    21
  • Lastpage
    39
  • Abstract
    An important way to improve the performance of naive Bayesian classifiers (NBCs) is to remove or relax the fundamental assumption of independence among the attributes, which usually results in an estimation of joint probability density function (p.d.f.) instead of the estimation of marginal p.d.f. in the NBC design. This paper proposes a non-naive Bayesian classifier (NNBC) in which the independence assumption is removed and the marginal p.d.f. estimation is replaced by the joint p.d.f. estimation. A new technique of estimating the class-conditional p.d.f. based on the optimal bandwidth selection, which is the crucial part of the joint p.d.f. estimation, is applied in our NNBC. Three well-known indexes for measuring the performance of Bayesian classifiers, which are classification accuracy, area under receiver operating characteristic curve, and probability mean square error, are adopted to conduct a comparison among the four Bayesian models, i.e., normal naive Bayesian, flexible naive Bayesian (FNB), the homologous model of FNB (FNBROT), and our proposed NNBC. The comparative results show that NNBC is statistically superior to the other three models regarding the three indexes. And, in the comparison with support vector machine and four boosting-based classification methods, NNBC achieves a relatively favorable classification accuracy while significantly reducing the training time.
  • Keywords
    Bayes methods; estimation theory; mean square error methods; pattern classification; FNB homologous model; NNBC; area-under-receiver operating characteristic curve; class-conditional p.d.f. estimation; classification accuracy; classification problems; continuous attributes; flexible naive Bayesian; joint p.d.f. estimation; joint probability density function estimation; marginal p.d.f. estimation; non Naive Bayesian classifiers; normal naive Bayesian; performance improvement; probability mean square error; Bandwidth; Bayes methods; Equations; Estimation; Joints; Kernel; Reactive power; Joint probability density estimation; kernel function; naive Bayesian classifier (NBC); optimal bandwidth; probability mean square error;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2013.2245891
  • Filename
    6471192