• DocumentCode
    472208
  • Title

    A Fast Boosting-Based Screening Method for Large-scale Association Study in Complex Traits with Genetic Heterogeneity

  • Author

    Wang, Lu-yong ; Fasulo, Daniel

  • Author_Institution
    Integrated Data Syst. Dept., Siemens Corp. Res. Inc., Princeton, NJ
  • fYear
    2006
  • fDate
    Aug. 30 2006-Sept. 3 2006
  • Firstpage
    5771
  • Lastpage
    5774
  • Abstract
    Genome-wide association study for complex diseases will generate massive amount of single nucleotide polymorphisms (SNPs) data. Univariate statistical test (i.e. Fisher exact test) was used to single out non-associated SNPs. However, the disease-susceptible SNPs may have little marginal effects in population and are unlikely to retain after the univariate tests. Also, model-based methods are impractical for large-scale dataset. Moreover, genetic heterogeneity makes the traditional methods harder to identify the genetic causes of diseases. A more recent random forest method provides a more robust method for screening the SNPs in thousands scale. However, for more large-scale data, i.e., Affymetrix Human Mapping 100K GeneChip data, a faster screening method is required to screening SNPs in whole-genome large scale association analysis with genetic heterogeneity. We propose a boosting-based method for rapid screening in large-scale analysis of complex traits in the presence of genetic heterogeneity. It provides a relatively fast and fairly good tool for screening and limiting the candidate SNPs for further more complex computational modeling task
  • Keywords
    cellular biophysics; computational complexity; diseases; genetics; medical computing; molecular biophysics; statistical analysis; Fisher exact test; affymetrix human mapping 100K GeneChip data; boosting-based screening method; complex disease; disease-susceptible SNP; genetic heterogeneity; genome-wide association study; single nucleotide polymorphism; univariate statistical test; Bayesian methods; Breast cancer; Cities and towns; Computational complexity; Diseases; Genetics; Large-scale systems; Plasma displays; Testing; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 2006. EMBS '06. 28th Annual International Conference of the IEEE
  • Conference_Location
    New York, NY
  • ISSN
    1557-170X
  • Print_ISBN
    1-4244-0032-5
  • Electronic_ISBN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/IEMBS.2006.260585
  • Filename
    4463118