Title :
AdaBoost algorithm with random forests for predicting breast cancer survivability
Author :
Thongkam, Jaree ; Xu, Guandong ; Zhang, Yanchun
Author_Institution :
Sch. of Comput. Sci. & Math., Victoria Univ., Melbourne, VIC
Abstract :
In this paper we propose a combination of the AdaBoost and random forests algorithms for constructing a breast cancer survivability prediction model. We use random forests as a weak learner of AdaBoost for selecting the high weight instances during the boosting process to improve accuracy, stability and to reduce overfitting problems. The capability of this hybrid method is evaluated using basic performance measurements (e.g., accuracy, sensitivity, and specificity), Receiver Operating Characteristic (ROC) curve and Area Under the receiver operating characteristic Curve (AUC). Experimental results indicate that the proposed method outperforms a single classifier and other combined classifiers for the breast cancer survivability prediction.
Keywords :
cancer; learning (artificial intelligence); medical computing; sensitivity analysis; stability; AdaBoost algorithm; breast cancer survivability prediction model; overfitting problems; random forests; receiver operating characteristic curve; stability; Breast cancer; Computer science; Data mining; Decision trees; Diseases; Mathematics; Medical diagnostic imaging; Predictive models; Signal processing algorithms; Support vector machines;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4634231