DocumentCode
3274879
Title
A comparative study among different kernel functions in flexible naïve Bayesian classification
Author
Liu, James N K ; He, Yu-Lin ; Wang, Xi-Zhao ; Hu, Yan-xing
Author_Institution
Dept. of Comput., Hong Kong Polytech. Univ., Kowloon, China
Volume
2
fYear
2011
fDate
10-13 July 2011
Firstpage
638
Lastpage
643
Abstract
When determining the class of the unknown example by using naïve Bayesian classifier, we need to estimate the class conditional probabilities for the continuous attributes. In flexible Bayesian classifier, the Gaussian kernel function is frequently used for classification task under the framework of Parzen window method. In this paper, the other six kernel functions (uniform, triangular, epanechnikov, biweight, triweight and cosine) are introduced in the flexible naïve Bayesian. The performances of these seven kernels are compared in 30 UCI datasets. The experimental comparisons are carried out according to the following three aspects: the classification accuracy, ranking performance and the class probability estimation. The latter two are measured by the area under the ROC curve (AUC) and the conditional log likelihood (CLL). The related kernels are compared via two-tailed t-test with a 95 percent confidence level and the Friedman´s test using the 0.05 critical level. The experimental results show that the most commonly used Gaussian kernel can not achieve the best classification accuracy and AUC. However, on the CLL, the Gaussian kernel is statistically significantly better than the other six kernels. Finally, the corresponding analyses are given based on the experimental results.
Keywords
Bayes methods; Gaussian processes; pattern classification; statistical distributions; Friedman test; Gaussian kernel function; Parzen window method; ROC curve; UCI datasets; class conditional probabilities; conditional log likelihood; continuous attributes; flexible naive Bayesian classification; kernel functions; probability estimation; Accuracy; Bayesian methods; Breast cancer; Estimation; Heart; Kernel; Machine learning; AUC; CLL; Gaussian; Naïve Bayesian classifier; biweight; cosine; density estimation; epanechnikov; triangular; triweight; uniform;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location
Guilin
ISSN
2160-133X
Print_ISBN
978-1-4577-0305-8
Type
conf
DOI
10.1109/ICMLC.2011.6016813
Filename
6016813
Link To Document