Title :
Improving Microarray Sample Size Using Bootstrap Data Combination
Author :
Phan, John H. ; Moffitt, Richard A. ; Barrett, Andrea B. ; Wang, May D.
Author_Institution :
Dept. of Biomed. Eng., Georgia Inst. of Technol., GA
Abstract :
Microarray technology has enabled us to simultaneously measure the expression of thousands of genes. Using this high-throughput technology, we can examine subtle genetic changes between biological samples and build predictive models for clinical applications. Although microarrays have dramatically increased the rate of data collection, sample size is still a major issue when selecting features. Previous methods show that combining multiple microarray datasets improves feature selection using simple methods such as fold change. We propose a wrapper-based gene selection technique that combines bootstrap estimated classification errors for individual genes across multiple datasets and reduces the contribution of datasets with high variance. We use the bootstrap because it is an unbiased estimator of classification error that is also effective for small sample data. Coupled with data combination across multiple datasets, we show that our meta-analytic approach improves the biological relevance of gene selection using prostate and renal cancer microarray data.
Keywords :
arrays; biological organs; biology computing; cancer; data communication; data handling; genetics; genomics; biological samples; bootstrap data combination; clinical applications; data collection; feature selection; gene expression; high-throughput technology; microarray sample size; multiple datasets; prostate cancer; renal cancer microarray data; unbiased estimator; wrapper-based gene selection technique; Biomarkers; Biomedical computing; Biomedical engineering; Biomedical measurements; Cancer; Filtering; Filters; Gene expression; Reproducibility of results; Size measurement; biomarker identification; data combination; microarray analysis;
Conference_Titel :
Computer and Computational Sciences, 2008. IMSCCS '08. International Multisymposiums on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3430-5
DOI :
10.1109/IMSCCS.2008.36