• DocumentCode
    3409350
  • Title

    Application of Relief-F feature filtering algorithm to selecting informative genes for cancer classification using microarray data

  • Author

    Wang, Yuhang ; Makedon, Fillia

  • Author_Institution
    Dartmouth Coll., Hanover, NH, USA
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    497
  • Lastpage
    498
  • Abstract
    Numerous recent studies have shown that microarray gene expression data is useful for cancer classification. Classification based on microarray data is very different from previous classification problems in that the number of features (genes) greatly exceeds the number of instances (tissue samples). It has been shown that selecting a small set of informative genes can lead to improved classification accuracy. It is thus important to first apply feature selection methods prior to classification. In the machine learning field, one of the most successful feature filtering algorithms is the Relief-F algorithm. In this work, we empirically evaluate its performance on three published cancer classification data sets. We use the linear SVM and the k-NN as classifiers in the experiments, and compare the performance of Relief-F with other feature filtering methods, including Information Gain, Gain Ratio, and χ2-statistic. Using the leave-one-out cross validation, experimental results show that the performance of Relief-F is comparable with other methods.
  • Keywords
    cancer; classification; filtering theory; genetics; learning (artificial intelligence); medical computing; support vector machines; tumours; Gain Ratio; Information Gain; Relief-F feature filtering algorithm; cancer classification; feature selection methods; k-NN; leave-one-out cross validation; linear SVM; machine learning; microarray gene expression data; selecting informative genes; tissue samples; Cancer; Filtering algorithms; Gene expression; Information filtering; Information filters; Machine learning; Machine learning algorithms; Performance gain; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332474
  • Filename
    1332474