• DocumentCode
    3353400
  • Title

    Predicting breast cancer survivability using random forest and multivariate adaptive regression splines

  • Author

    Dengju Yao ; Jing Yang ; Xiaojuan Zhan

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin, China
  • Volume
    4
  • fYear
    2011
  • fDate
    12-14 Aug. 2011
  • Firstpage
    2204
  • Lastpage
    2207
  • Abstract
    In this paper, we propose a hybrid of random forest and multivariate adaptive regression splines algorithms for building a breast cancer survivability prediction model. We use random forest to perform a preliminary screening of variables and to receive a importance ranks. Then, the new dataset is extracted from initial WDBC dataset according to top-k important predictors and is input into the MARS procedure, which is responsible for building interpretable models for predicting breast cancer survivability. The capability of this combination method is evaluated using basic performance measurements (e.g., accuracy, sensitivity, and specificity) along with a 10-fold cross-validation. Experimental results show that the proposed method provides a higher accuracy and a relatively simple model.
  • Keywords
    adaptive estimation; cancer; gynaecology; medical diagnostic computing; random processes; regression analysis; 10-fold cross-validation; MARS procedure; breast cancer survivability prediction model; multivariate adaptive regression splines; random forest; Accuracy; Breast cancer; Mars; Mathematical model; Predictive models; Radio frequency; Breast cancer; multivariate adaptive regression spline; random forest;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronic and Mechanical Engineering and Information Technology (EMEIT), 2011 International Conference on
  • Conference_Location
    Harbin, Heilongjiang, China
  • Print_ISBN
    978-1-61284-087-1
  • Type

    conf

  • DOI
    10.1109/EMEIT.2011.6023012
  • Filename
    6023012