Title :
Effective Gene Selection Method With Small Sample Sets Using Gradient-Based and Point Injection Techniques
Author :
Huang, D. ; Chow, Tommy W S
Author_Institution :
City Univ. of Hong Kong, Hong Kong
Abstract :
Microarray gene expression data usually consist of a large amount of genes. Among these genes, only a small fraction are informative for performing a cancer diagnostic test. This paper focuses on effective identification of informative genes. We analyze gene selection models from the perspective of optimization theory. As a result, a new strategy is designed to modify conventional search engines. Also, as overfitting is likely to occur in microarray data because of their small sample set, a point injection technique is developed to address the problem of overfitting. The proposed strategies have been evaluated on three kinds of cancer diagnosis. Our results show that the proposed strategies can improve the performance of gene selection substantially. The experimental results also indicate that the proposed methods are very robust under all of the investigated cases.
Keywords :
cancer; genetic engineering; gradient methods; medical computing; optimisation; patient diagnosis; search engines; cancer diagnostic test; gene selection method; gene selection models; gradient based techniques; informative gene identification; microarray gene expression data; optimization theory; overfitting; point injection techniques; search engines; DNA; Data analysis; Filters; Gene expression; Performance evaluation; Prostate cancer; Robustness; Search engines; Stochastic processes; Testing; gene selection; gradient based learning; optimization theory; point injection; Algorithms; Artificial Intelligence; Diagnosis, Computer-Assisted; Gene Expression Profiling; Humans; Neoplasm Proteins; Neoplasms; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Sample Size; Signal Processing, Computer-Assisted; Tumor Markers, Biological;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/tcbb.2007.1021