Title :
A novel approach for selecting informative genes from gene expression data using Signal-to-Noise Ratio and t-statistics
Author :
Sahu, Barnali ; Mishra, Debahuti
Author_Institution :
Inst. of Tech. Educ. & Res., Siksha O Anusandhan Univ., Bhubaneswar, India
Abstract :
Signal-to-Noise Ratio (SNR) and t-statistics are widely used for gene ranking in the analysis of microarray gene expression data. By implementing these filtering techniques directly to the microarray data may give redundant features, as we may have redundant expression values of number of genes in the data set. By grouping the genes bearing similar expression values in a single cluster and then implementing the given filtering techniques to rank the genes in each cluster and by selecting top ranked genes from each cluster give better result towards biomarker selection. In this paper we have taken four cancer data sets and k-means clustering technique to cluster the genes. Support vector machine and k-nearest Neighbor are used for classification and the method for validation is 10 fold cross validation.
Keywords :
cancer; filtering theory; genetics; medical signal processing; pattern classification; pattern clustering; statistical testing; support vector machines; biomarker selection; cancer data set; classification; filtering technique; gene ranking; informative genes; k-means clustering; k-nearest neighbor; microarray gene expression data; signal-to-noise ratio; support vector machine; t-statistics; Accuracy; Cancer; Clustering algorithms; Colon; Gene expression; Signal to noise ratio; Support vector machines; 10 fold cross validation; Microarray; Signal-to-Noise Ratio; biomarker; classification; k-means; k-nearest neighbor; support vector machine; t-statistics;
Conference_Titel :
Computer and Communication Technology (ICCCT), 2011 2nd International Conference on
Conference_Location :
Allahabad
Print_ISBN :
978-1-4577-1385-9
DOI :
10.1109/ICCCT.2011.6075207