• DocumentCode
    79504
  • Title

    Parametric Estimation of the Local False Discovery Rate for Identifying Genetic Associations

  • Author

    Ye Yang ; Aghababazadeh, Farnoosh Abbas ; Bickel, David R.

  • Author_Institution
    Bank of Nova Scotia (Scotiabank), Toronto, ON, Canada
  • Volume
    10
  • Issue
    1
  • fYear
    2013
  • fDate
    Jan.-Feb. 2013
  • Firstpage
    98
  • Lastpage
    108
  • Abstract
    Many genome-wide association studies have been conducted to identify single nucleotide polymorphisms (SNPs) that are associated with particular diseases or other traits. The local false discovery rate (LFDR) estimated using semiparametric models has enjoyed success in simultaneous inference. However, semiparametric LFDR estimators can be biased because they tend to overestimate the proportion of the nonassociated SNPs. We address the problem by adapting a simple parametric mixture model (PMM) and by comparing this model to the semiparametric mixture model (SMM) behind an LFDR estimator that is known to be conservatively biased. Then, we also compare the PMM with a parametric nonmixture model (PNM). In our simulation studies, we thoroughly analyze the performances of the three models under different values of p1, a prior probability that is approximately equal to the proportion of SNPs that are associated with the disease. When p1 > 10%, the PMM generally performs better than the SMM. When p1 <; 0:1%, the SMM outperforms PMM. When p1 lies between 0.1 and 10 percent, both methods have about the same performance. In that setting, the PMM may be preferred since it has the advantage of supplying an estimate of the detectability level of the nonassociated SNPs.
  • Keywords
    bioinformatics; diseases; genomics; polymorphism; LFDR estimation; disease; genetic association identification; genome wide association; local false discovery rate; parametric estimation; parametric nonmixture model; semiparametric mixture model; semiparametric model; simple parametric mixture model; single nucleotide polymorphism; Adaptation models; Analytical models; Bioinformatics; Diseases; Estimation; Solid modeling; Standards; Empirical Bayes; MDL; Type II maximum likelihood; genome-wide association studies; local false discovery rate; minimum description length; reduced likelihood; strength of statistical evidence; Bayes Theorem; Computational Biology; Computer Simulation; Genome-Wide Association Study; Humans; Models, Genetic; Models, Statistical; Polymorphism, Single Nucleotide;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2012.140
  • Filename
    6365171