Title :
Prominent label identification and multi-label classification for cancer prognosis prediction
Author :
Saleema, J.S. ; Sairam, B. ; Naveen, S.D. ; Yuvaraj, K. ; Patnaik, L.M.
Author_Institution :
Dept. of Comp.Sc., Christ Univ., Bangalore, India
Abstract :
Cancer prognosis prediction improves the quality of treatment and increases the survivability of the patients. Conventional methods of cancer prediction deal with single class by limiting the prognosis prediction to one response variable. The SEER Public Use cancer database has more prominent variables that support better prediction approach. The objective of this paper is to find the prominent labels from cancer databases and use them in a multi-class environment. The implementation consist of three phases namely, pre-processing, prominent label identification and multi-label classification. Breast, Colorectal and Respiratory Cancer Data sets have been used for the experimentation. Also random samples from all three data sets are generated to form a mixed cancer data. Patient survival, number of primaries and age at diagnosis are the prominent labels identified from others using the Decision tree, Naïve Bayes and KNN algorithms. The three prominent labels have been tested using multi-label RAkEL algorithm to find the relations between them. The results of the empirical study are comparatively better than the traditional way of cancer prediction.
Keywords :
Bayes methods; biological organs; cancer; decision trees; medical diagnostic computing; patient diagnosis; patient treatment; pattern classification; KNN algorithms; Naive Bayes algorithm; SEER public use cancer database; breast cancer data sets; cancer prognosis prediction; colorectal cancer data sets; decision tree algorithms; multilabel RAkEL algorithm; multilabel classification; patient diagnosis; patient survivability; patient treatment; prominent label identification; respiratory cancer data sets; Accuracy; Breast; Cancer; Classification algorithms; Data mining; Decision trees; Prediction algorithms; Classifier; Multi-label; Response Variable; SEER;
Conference_Titel :
TENCON 2012 - 2012 IEEE Region 10 Conference
Conference_Location :
Cebu
Print_ISBN :
978-1-4673-4823-2
Electronic_ISBN :
2159-3442
DOI :
10.1109/TENCON.2012.6412321