• DocumentCode
    961102
  • Title

    Nonlinear Support Vector Machine Visualization for Risk Factor Analysis Using Nomograms and Localized Radial Basis Function Kernels

  • Author

    Cho, Baek Hwan ; Yu, Hwanjo ; Lee, Jongshill ; Chee, Young Joon ; Kim, In Young ; Kim, Sun I.

  • Author_Institution
    Hanyang Univ., Seoul
  • Volume
    12
  • Issue
    2
  • fYear
    2008
  • fDate
    3/1/2008 12:00:00 AM
  • Firstpage
    247
  • Lastpage
    256
  • Abstract
    Nonlinear classifiers, e.g., support vector machines (SVMs) with radial basis function (RBF) kernels, have been used widely for automatic diagnosis of diseases because of their high accuracies. However, it is difficult to visualize the classifiers, and thus difficult to provide intuitive interpretation of results to physicians. We developed a new nonlinear kernel, the localized radial basis function (LRBF) kernel, and new visualization system visualization for risk factor analysis (VRIFA) that applies a nomogram and LRBF kernel to visualize the results of nonlinear SVMs and improve the interpretability of results while maintaining high prediction accuracy. Three representative medical datasets from the University of California, Irvine repository and Statlog dataset-breast cancer, diabetes, and heart disease datasets-were used to evaluate the system. The results showed that the classification performance of the LRBF is comparable with that of the RBF, and the LRBF is easy to visualize via a nomogram. Our study also showed that the LRBF kernel is less sensitive to noise features than the RBF kernel, whereas the LRBF kernel degrades the prediction accuracy more when important features are eliminated. We demonstrated the VRIFA system, which visualizes the results of linear and nonlinear SVMs with LRBF kernels, on the three datasets.
  • Keywords
    cardiology; data visualisation; diseases; mammography; medical diagnostic computing; nomograms; pattern classification; radial basis function networks; support vector machines; Irvine repository and Statlog dataset; RBF kernels; SVM; University of California; automatic disease diagnosis; breast cancer; classification performance; diabetes; feature selection methods; heart disease datasets; localized radial basis function kernels; nomograms; nonlinear support vector machine visualization system; risk factor analysis; Decision support systems; Feature selection; Localized Radial Basis Function kernel; Nomograms; Support vector machines; Visualization; feature selection; localized radial basis function (LRBF) kernel; nomograms; support vector machines (SVMs); visualization; Artificial Intelligence; Computer Graphics; Diagnosis, Computer-Assisted; Nonlinear Dynamics; Pattern Recognition, Automated; Prognosis; Proportional Hazards Models; Risk Assessment; Risk Factors; User-Computer Interface;
  • fLanguage
    English
  • Journal_Title
    Information Technology in Biomedicine, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-7771
  • Type

    jour

  • DOI
    10.1109/TITB.2007.902300
  • Filename
    4374086