• DocumentCode
    3673638
  • Title

    Building an Effective Classification Model for Breast Cancer Patient Response Data

  • Author

    Brian Heredia;Taghi M. Khoshgoftaar;Alireza Fazelpour;David J. Dittman

  • Author_Institution
    Florida Atlantic Univ., Boca Raton, FL, USA
  • fYear
    2015
  • Firstpage
    229
  • Lastpage
    235
  • Abstract
    Choosing an appropriate cancer treatment is potentially the most important task in the treatment of a cancer patient. If it were possible to identify the best option for a patient (or at minimum to remove options that will not help the patient), then the general prognosis of the patient improves. However, this task becomes much more subtle due to characteristics such as high dimensionality found in many gene expression datasets. In this study, we seek to identify classifiers and feature selection techniques best suited for predicting a breast cancer patient´s response to a cancer treatment. In order to determine this, we have collected a group of five high-dimensional breast cancer patient response datasets and use a group of four classifiers, and three feature selection techniques along with four feature subset sizes. Our results show that 5-Nearest Neighbor classifier and Signal-to-Noise feature selection technique are the most frequently top performing techniques. Statistical analysis confirms that these techniques are the top performing techniques. Thus, we recommend the use of 5-Nearest Neighbor and Signal-to-Noise for breast cancer patient response data. To our knowledge, this is the first study that focuses on the classification process on patient response data for breast cancer.
  • Keywords
    "Breast cancer","Support vector machines","Data models","Medical treatment","Measurement","Buildings"
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/IRI.2015.46
  • Filename
    7300982