• DocumentCode
    1092787
  • Title

    Effective Gene Selection Method With Small Sample Sets Using Gradient-Based and Point Injection Techniques

  • Author

    Huang, D. ; Chow, Tommy W S

  • Author_Institution
    City Univ. of Hong Kong, Hong Kong
  • Volume
    4
  • Issue
    3
  • fYear
    2007
  • Firstpage
    467
  • Lastpage
    475
  • Abstract
    Microarray gene expression data usually consist of a large amount of genes. Among these genes, only a small fraction are informative for performing a cancer diagnostic test. This paper focuses on effective identification of informative genes. We analyze gene selection models from the perspective of optimization theory. As a result, a new strategy is designed to modify conventional search engines. Also, as overfitting is likely to occur in microarray data because of their small sample set, a point injection technique is developed to address the problem of overfitting. The proposed strategies have been evaluated on three kinds of cancer diagnosis. Our results show that the proposed strategies can improve the performance of gene selection substantially. The experimental results also indicate that the proposed methods are very robust under all of the investigated cases.
  • Keywords
    cancer; genetic engineering; gradient methods; medical computing; optimisation; patient diagnosis; search engines; cancer diagnostic test; gene selection method; gene selection models; gradient based techniques; informative gene identification; microarray gene expression data; optimization theory; overfitting; point injection techniques; search engines; DNA; Data analysis; Filters; Gene expression; Performance evaluation; Prostate cancer; Robustness; Search engines; Stochastic processes; Testing; gene selection; gradient based learning; optimization theory; point injection; Algorithms; Artificial Intelligence; Diagnosis, Computer-Assisted; Gene Expression Profiling; Humans; Neoplasm Proteins; Neoplasms; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Sample Size; Signal Processing, Computer-Assisted; Tumor Markers, Biological;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/tcbb.2007.1021
  • Filename
    4288072