DocumentCode
1092787
Title
Effective Gene Selection Method With Small Sample Sets Using Gradient-Based and Point Injection Techniques
Author
Huang, D. ; Chow, Tommy W S
Author_Institution
City Univ. of Hong Kong, Hong Kong
Volume
4
Issue
3
fYear
2007
Firstpage
467
Lastpage
475
Abstract
Microarray gene expression data usually consist of a large amount of genes. Among these genes, only a small fraction are informative for performing a cancer diagnostic test. This paper focuses on effective identification of informative genes. We analyze gene selection models from the perspective of optimization theory. As a result, a new strategy is designed to modify conventional search engines. Also, as overfitting is likely to occur in microarray data because of their small sample set, a point injection technique is developed to address the problem of overfitting. The proposed strategies have been evaluated on three kinds of cancer diagnosis. Our results show that the proposed strategies can improve the performance of gene selection substantially. The experimental results also indicate that the proposed methods are very robust under all of the investigated cases.
Keywords
cancer; genetic engineering; gradient methods; medical computing; optimisation; patient diagnosis; search engines; cancer diagnostic test; gene selection method; gene selection models; gradient based techniques; informative gene identification; microarray gene expression data; optimization theory; overfitting; point injection techniques; search engines; DNA; Data analysis; Filters; Gene expression; Performance evaluation; Prostate cancer; Robustness; Search engines; Stochastic processes; Testing; gene selection; gradient based learning; optimization theory; point injection; Algorithms; Artificial Intelligence; Diagnosis, Computer-Assisted; Gene Expression Profiling; Humans; Neoplasm Proteins; Neoplasms; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Sample Size; Signal Processing, Computer-Assisted; Tumor Markers, Biological;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/tcbb.2007.1021
Filename
4288072
Link To Document