DocumentCode
71212
Title
Non-Naive Bayesian Classifiers for Classification Problems With Continuous Attributes
Author
Xi-Zhao Wang ; Yu-Lin He ; Wang, Da Da
Author_Institution
Machine Learning Center, Hebei Univ., Baoding, China
Volume
44
Issue
1
fYear
2014
fDate
Jan. 2014
Firstpage
21
Lastpage
39
Abstract
An important way to improve the performance of naive Bayesian classifiers (NBCs) is to remove or relax the fundamental assumption of independence among the attributes, which usually results in an estimation of joint probability density function (p.d.f.) instead of the estimation of marginal p.d.f. in the NBC design. This paper proposes a non-naive Bayesian classifier (NNBC) in which the independence assumption is removed and the marginal p.d.f. estimation is replaced by the joint p.d.f. estimation. A new technique of estimating the class-conditional p.d.f. based on the optimal bandwidth selection, which is the crucial part of the joint p.d.f. estimation, is applied in our NNBC. Three well-known indexes for measuring the performance of Bayesian classifiers, which are classification accuracy, area under receiver operating characteristic curve, and probability mean square error, are adopted to conduct a comparison among the four Bayesian models, i.e., normal naive Bayesian, flexible naive Bayesian (FNB), the homologous model of FNB (FNBROT), and our proposed NNBC. The comparative results show that NNBC is statistically superior to the other three models regarding the three indexes. And, in the comparison with support vector machine and four boosting-based classification methods, NNBC achieves a relatively favorable classification accuracy while significantly reducing the training time.
Keywords
Bayes methods; estimation theory; mean square error methods; pattern classification; FNB homologous model; NNBC; area-under-receiver operating characteristic curve; class-conditional p.d.f. estimation; classification accuracy; classification problems; continuous attributes; flexible naive Bayesian; joint p.d.f. estimation; joint probability density function estimation; marginal p.d.f. estimation; non Naive Bayesian classifiers; normal naive Bayesian; performance improvement; probability mean square error; Bandwidth; Bayes methods; Equations; Estimation; Joints; Kernel; Reactive power; Joint probability density estimation; kernel function; naive Bayesian classifier (NBC); optimal bandwidth; probability mean square error;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TCYB.2013.2245891
Filename
6471192
Link To Document