• DocumentCode
    2292283
  • Title

    A novel approach for selecting informative genes from gene expression data using Signal-to-Noise Ratio and t-statistics

  • Author

    Sahu, Barnali ; Mishra, Debahuti

  • Author_Institution
    Inst. of Tech. Educ. & Res., Siksha O Anusandhan Univ., Bhubaneswar, India
  • fYear
    2011
  • fDate
    15-17 Sept. 2011
  • Firstpage
    5
  • Lastpage
    10
  • Abstract
    Signal-to-Noise Ratio (SNR) and t-statistics are widely used for gene ranking in the analysis of microarray gene expression data. By implementing these filtering techniques directly to the microarray data may give redundant features, as we may have redundant expression values of number of genes in the data set. By grouping the genes bearing similar expression values in a single cluster and then implementing the given filtering techniques to rank the genes in each cluster and by selecting top ranked genes from each cluster give better result towards biomarker selection. In this paper we have taken four cancer data sets and k-means clustering technique to cluster the genes. Support vector machine and k-nearest Neighbor are used for classification and the method for validation is 10 fold cross validation.
  • Keywords
    cancer; filtering theory; genetics; medical signal processing; pattern classification; pattern clustering; statistical testing; support vector machines; biomarker selection; cancer data set; classification; filtering technique; gene ranking; informative genes; k-means clustering; k-nearest neighbor; microarray gene expression data; signal-to-noise ratio; support vector machine; t-statistics; Accuracy; Cancer; Clustering algorithms; Colon; Gene expression; Signal to noise ratio; Support vector machines; 10 fold cross validation; Microarray; Signal-to-Noise Ratio; biomarker; classification; k-means; k-nearest neighbor; support vector machine; t-statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Communication Technology (ICCCT), 2011 2nd International Conference on
  • Conference_Location
    Allahabad
  • Print_ISBN
    978-1-4577-1385-9
  • Type

    conf

  • DOI
    10.1109/ICCCT.2011.6075207
  • Filename
    6075207