• DocumentCode
    3274879
  • Title

    A comparative study among different kernel functions in flexible naïve Bayesian classification

  • Author

    Liu, James N K ; He, Yu-Lin ; Wang, Xi-Zhao ; Hu, Yan-xing

  • Author_Institution
    Dept. of Comput., Hong Kong Polytech. Univ., Kowloon, China
  • Volume
    2
  • fYear
    2011
  • fDate
    10-13 July 2011
  • Firstpage
    638
  • Lastpage
    643
  • Abstract
    When determining the class of the unknown example by using naïve Bayesian classifier, we need to estimate the class conditional probabilities for the continuous attributes. In flexible Bayesian classifier, the Gaussian kernel function is frequently used for classification task under the framework of Parzen window method. In this paper, the other six kernel functions (uniform, triangular, epanechnikov, biweight, triweight and cosine) are introduced in the flexible naïve Bayesian. The performances of these seven kernels are compared in 30 UCI datasets. The experimental comparisons are carried out according to the following three aspects: the classification accuracy, ranking performance and the class probability estimation. The latter two are measured by the area under the ROC curve (AUC) and the conditional log likelihood (CLL). The related kernels are compared via two-tailed t-test with a 95 percent confidence level and the Friedman´s test using the 0.05 critical level. The experimental results show that the most commonly used Gaussian kernel can not achieve the best classification accuracy and AUC. However, on the CLL, the Gaussian kernel is statistically significantly better than the other six kernels. Finally, the corresponding analyses are given based on the experimental results.
  • Keywords
    Bayes methods; Gaussian processes; pattern classification; statistical distributions; Friedman test; Gaussian kernel function; Parzen window method; ROC curve; UCI datasets; class conditional probabilities; conditional log likelihood; continuous attributes; flexible naive Bayesian classification; kernel functions; probability estimation; Accuracy; Bayesian methods; Breast cancer; Estimation; Heart; Kernel; Machine learning; AUC; CLL; Gaussian; Naïve Bayesian classifier; biweight; cosine; density estimation; epanechnikov; triangular; triweight; uniform;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
  • Conference_Location
    Guilin
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4577-0305-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2011.6016813
  • Filename
    6016813