DocumentCode
3038149
Title
Support vectors based correlation coefficient for gene and sample selection in cancer classification
Author
Mundra, Piyushkumar A. ; Rajapakse, Jagath C.
Author_Institution
Biolnformatics Res. Center, Nanyang Technol. Univ., Singapore, Singapore
fYear
2010
fDate
2-5 May 2010
Firstpage
1
Lastpage
7
Abstract
Correlation is a very widely used filter criterion for gene selection in cancer classification. However, it uses all the training samples in ranking, which may not be equally important for the classification. Using support vectors, we demonstrate that classical correlation coefficient based gene selection is biased because of the sample points away from classification margin. To remove such bias, we use only the support vectors for computation of correlation coefficient and propose a backward elimination based SVcc-RFE algorithm. The proposed method is tested on several benchmark cancer gene expression datasets and the results show improvement in classification performance compared to other state-of-the-art methods.
Keywords
bioinformatics; cancer; pattern classification; support vector machines; SVcc-RFE algorithm; backward elimination; cancer classification; correlation coefficient; gene selection; sample selection; support vectors; Benchmark testing; Cancer; Costs; DNA; Data mining; Filters; Gene expression; Partitioning algorithms; Size measurement; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2010 IEEE Symposium on
Conference_Location
Montreal, QC
Print_ISBN
978-1-4244-6766-2
Type
conf
DOI
10.1109/CIBCB.2010.5510689
Filename
5510689
Link To Document