Title of article :
Survival Prediction and Feature Selection in Patients with Breast Cancer Using Support Vector Regression
Author/Authors :
Goli, Shahrbanoo Department of Biostatistics and Epidemiology - School of Public Health - Hamadan University of Medical Sciences - Hamadan, Iran , Mahjub, Hossein Department of Biostatistics and Epidemiology - School of Public Health - Hamadan University of Medical Sciences - Hamadan, Iran , Faradmal, Javad Department of Biostatistics and Epidemiology - School of Public Health - Hamadan University of Medical Sciences - Hamadan, Iran , Mashayekhi, Hoda Computer and IT Engineering Department - Shahrood University - Shahrood, Iran , Soltanian, Ali-Reza Department of Biostatistics and Epidemiology - School of Public Health - Hamadan University of Medical Sciences - Hamadan, Iran
Abstract :
The Support Vector Regression (SVR) model has been broadly used for response prediction. However, few researchers have used
SVR for survival analysis. In this study, a new SVR model is proposed and SVR with different kernels and the traditional Cox model
are trained. The models are compared based on different performance measures. We also select the best subset of features using
three feature selection methods: combination of SVR and statistical tests, univariate feature selection based on concordance index,
and recursive feature elimination. The evaluations are performed using available medical datasets and also a Breast Cancer (BC)
dataset consisting of 573 patients who visited the Oncology Clinic of Hamadan province in Iran. Results show that, for the BC
dataset, survival time can be predicted more accurately by linear SVR than nonlinear SVR. Based on the three feature selection
methods, metastasis status, progesterone receptor status, and human epidermal growth factor receptor 2 status are the best features
associated to survival. Also, according to the obtained results, performance of linear and nonlinear kernels is comparable. The
proposed SVR model performs similar to or slightly better than other models. Also, SVR performs similar to or better than Cox
when all features are included in model.
Keywords :
Survival , SVR , Breast Cancer Using Support Vector Regression
Journal title :
Computational and Mathematical Methods in Medicine