DocumentCode :
3409350
Title :
Application of Relief-F feature filtering algorithm to selecting informative genes for cancer classification using microarray data
Author :
Wang, Yuhang ; Makedon, Fillia
Author_Institution :
Dartmouth Coll., Hanover, NH, USA
fYear :
2004
fDate :
16-19 Aug. 2004
Firstpage :
497
Lastpage :
498
Abstract :
Numerous recent studies have shown that microarray gene expression data is useful for cancer classification. Classification based on microarray data is very different from previous classification problems in that the number of features (genes) greatly exceeds the number of instances (tissue samples). It has been shown that selecting a small set of informative genes can lead to improved classification accuracy. It is thus important to first apply feature selection methods prior to classification. In the machine learning field, one of the most successful feature filtering algorithms is the Relief-F algorithm. In this work, we empirically evaluate its performance on three published cancer classification data sets. We use the linear SVM and the k-NN as classifiers in the experiments, and compare the performance of Relief-F with other feature filtering methods, including Information Gain, Gain Ratio, and χ2-statistic. Using the leave-one-out cross validation, experimental results show that the performance of Relief-F is comparable with other methods.
Keywords :
cancer; classification; filtering theory; genetics; learning (artificial intelligence); medical computing; support vector machines; tumours; Gain Ratio; Information Gain; Relief-F feature filtering algorithm; cancer classification; feature selection methods; k-NN; leave-one-out cross validation; linear SVM; machine learning; microarray gene expression data; selecting informative genes; tissue samples; Cancer; Filtering algorithms; Gene expression; Information filtering; Information filters; Machine learning; Machine learning algorithms; Performance gain; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
Type :
conf
DOI :
10.1109/CSB.2004.1332474
Filename :
1332474
Link To Document :
بازگشت