DocumentCode
2292283
Title
A novel approach for selecting informative genes from gene expression data using Signal-to-Noise Ratio and t-statistics
Author
Sahu, Barnali ; Mishra, Debahuti
Author_Institution
Inst. of Tech. Educ. & Res., Siksha O Anusandhan Univ., Bhubaneswar, India
fYear
2011
fDate
15-17 Sept. 2011
Firstpage
5
Lastpage
10
Abstract
Signal-to-Noise Ratio (SNR) and t-statistics are widely used for gene ranking in the analysis of microarray gene expression data. By implementing these filtering techniques directly to the microarray data may give redundant features, as we may have redundant expression values of number of genes in the data set. By grouping the genes bearing similar expression values in a single cluster and then implementing the given filtering techniques to rank the genes in each cluster and by selecting top ranked genes from each cluster give better result towards biomarker selection. In this paper we have taken four cancer data sets and k-means clustering technique to cluster the genes. Support vector machine and k-nearest Neighbor are used for classification and the method for validation is 10 fold cross validation.
Keywords
cancer; filtering theory; genetics; medical signal processing; pattern classification; pattern clustering; statistical testing; support vector machines; biomarker selection; cancer data set; classification; filtering technique; gene ranking; informative genes; k-means clustering; k-nearest neighbor; microarray gene expression data; signal-to-noise ratio; support vector machine; t-statistics; Accuracy; Cancer; Clustering algorithms; Colon; Gene expression; Signal to noise ratio; Support vector machines; 10 fold cross validation; Microarray; Signal-to-Noise Ratio; biomarker; classification; k-means; k-nearest neighbor; support vector machine; t-statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communication Technology (ICCCT), 2011 2nd International Conference on
Conference_Location
Allahabad
Print_ISBN
978-1-4577-1385-9
Type
conf
DOI
10.1109/ICCCT.2011.6075207
Filename
6075207
Link To Document