DocumentCode :
2035787
Title :
A signal-to-noise classification model for identification of differentially expressed genes from gene expression data
Author :
Mishra, Debahuti ; Sahu, Barnali
Author_Institution :
Inst. of Tech. Educ. & Res., Siksha O Anusandhan Univ., Bhubaneswar, India
Volume :
2
fYear :
2011
fDate :
8-10 April 2011
Firstpage :
204
Lastpage :
208
Abstract :
A major focus in cancer research is identifying genetic markers or biomarkers. To build a robust classifier we have to find out the differentially expressed genes (key genes) in binary classification. The differentially expressed genes or biomarker gene selection is the preprocessing task for cancer classification. In this paper, we have compared the results of two approaches for selecting biomarkers from Leukemia data set. The first approach for feature selection is by implementing k-means clustering and signal-to-noise ratio (SNR) method for gene ranking, the top scored genes from each cluster is selected and given to the classifiers. The second approach uses signal to noise ratio ranking only for feature selection. For validation of both the approaches, we have used k nearest neighbor (kNN), support vector machine (SVM), probabilistic Neural Network (PNN) and Feed Forward Neural Network (fNN). After comparing the final results of two approaches we have got 100%, 96%and 96% accuracy with SVM, kNN and PNN respectively in first approach with five numbers of genes. Whereas, performance of FNN is 2.17 with 10 numbers of genes. In second approach we have got 96%, 96% and 62% accuracies for SVM, kNN and PNN respectively for 5 numbers of genes and the performance of FNN is 2.52 for 10 genes.
Keywords :
cancer; feature extraction; feedforward neural nets; genetic algorithms; learning (artificial intelligence); pattern classification; pattern clustering; support vector machines; biomarker gene selection; cancer classification; differentially expressed gene; feature selection; feedforward neural network; gene expression data; genetic marker; k nearest neighbor; leukemia data set; probabilistic neural network; robust classifier; signal-to-noise classification model; support vector machine; Accuracy; Artificial neural networks; Cancer; Clustering algorithms; Feeds; Signal to noise ratio; Support vector machines; Differentially Expressed Genes; Feature Selection; Feed forward neural network; K-means; Probabilistic neural network; Signal to Noise ratio; Support vector machine; k-Nearest Neighbor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics Computer Technology (ICECT), 2011 3rd International Conference on
Conference_Location :
Kanyakumari
Print_ISBN :
978-1-4244-8678-6
Electronic_ISBN :
978-1-4244-8679-3
Type :
conf
DOI :
10.1109/ICECTECH.2011.5941685
Filename :
5941685
Link To Document :
بازگشت