• DocumentCode
    2459127
  • Title

    Improving Microarray Sample Size Using Bootstrap Data Combination

  • Author

    Phan, John H. ; Moffitt, Richard A. ; Barrett, Andrea B. ; Wang, May D.

  • Author_Institution
    Dept. of Biomed. Eng., Georgia Inst. of Technol., GA
  • fYear
    2008
  • fDate
    18-20 Oct. 2008
  • Firstpage
    37
  • Lastpage
    44
  • Abstract
    Microarray technology has enabled us to simultaneously measure the expression of thousands of genes. Using this high-throughput technology, we can examine subtle genetic changes between biological samples and build predictive models for clinical applications. Although microarrays have dramatically increased the rate of data collection, sample size is still a major issue when selecting features. Previous methods show that combining multiple microarray datasets improves feature selection using simple methods such as fold change. We propose a wrapper-based gene selection technique that combines bootstrap estimated classification errors for individual genes across multiple datasets and reduces the contribution of datasets with high variance. We use the bootstrap because it is an unbiased estimator of classification error that is also effective for small sample data. Coupled with data combination across multiple datasets, we show that our meta-analytic approach improves the biological relevance of gene selection using prostate and renal cancer microarray data.
  • Keywords
    arrays; biological organs; biology computing; cancer; data communication; data handling; genetics; genomics; biological samples; bootstrap data combination; clinical applications; data collection; feature selection; gene expression; high-throughput technology; microarray sample size; multiple datasets; prostate cancer; renal cancer microarray data; unbiased estimator; wrapper-based gene selection technique; Biomarkers; Biomedical computing; Biomedical engineering; Biomedical measurements; Cancer; Filtering; Filters; Gene expression; Reproducibility of results; Size measurement; biomarker identification; data combination; microarray analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Computational Sciences, 2008. IMSCCS '08. International Multisymposiums on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3430-5
  • Type

    conf

  • DOI
    10.1109/IMSCCS.2008.36
  • Filename
    4760294