DocumentCode
3409350
Title
Application of Relief-F feature filtering algorithm to selecting informative genes for cancer classification using microarray data
Author
Wang, Yuhang ; Makedon, Fillia
Author_Institution
Dartmouth Coll., Hanover, NH, USA
fYear
2004
fDate
16-19 Aug. 2004
Firstpage
497
Lastpage
498
Abstract
Numerous recent studies have shown that microarray gene expression data is useful for cancer classification. Classification based on microarray data is very different from previous classification problems in that the number of features (genes) greatly exceeds the number of instances (tissue samples). It has been shown that selecting a small set of informative genes can lead to improved classification accuracy. It is thus important to first apply feature selection methods prior to classification. In the machine learning field, one of the most successful feature filtering algorithms is the Relief-F algorithm. In this work, we empirically evaluate its performance on three published cancer classification data sets. We use the linear SVM and the k-NN as classifiers in the experiments, and compare the performance of Relief-F with other feature filtering methods, including Information Gain, Gain Ratio, and χ2-statistic. Using the leave-one-out cross validation, experimental results show that the performance of Relief-F is comparable with other methods.
Keywords
cancer; classification; filtering theory; genetics; learning (artificial intelligence); medical computing; support vector machines; tumours; Gain Ratio; Information Gain; Relief-F feature filtering algorithm; cancer classification; feature selection methods; k-NN; leave-one-out cross validation; linear SVM; machine learning; microarray gene expression data; selecting informative genes; tissue samples; Cancer; Filtering algorithms; Gene expression; Information filtering; Information filters; Machine learning; Machine learning algorithms; Performance gain; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN
0-7695-2194-0
Type
conf
DOI
10.1109/CSB.2004.1332474
Filename
1332474
Link To Document