• DocumentCode
    1784875
  • Title

    A heterogeneous compute solution for optimized genomic selection analysis

  • Author

    DeVore, Trevor ; Winkleblack, Scott ; Golden, Bruce ; Lupo, Chris

  • Author_Institution
    California Polytech. State Univ., San Luis, AZ, USA
  • fYear
    2014
  • fDate
    2-5 Nov. 2014
  • Firstpage
    363
  • Lastpage
    370
  • Abstract
    This paper presents a heterogeneous computing solution for an optimized genetic selection analysis tool, GenSel. GenSel can be used to efficiently infer the effects of genetic markers on a desired trait or to determine the genomic estimated breeding values (GEBV) of genotyped individuals. To predict which genetic markers are informational, GenSel performs Bayesian inference using Gibbs sampling, a Markov Chain Monte Carlo (MCMC) algorithm. Parallelizing this algorithm proves to be a technically challenging problem because there exists a loop carried dependence between each iteration of the Markov chain. The approach presented in this paper exploits both task-level parallelism (TLP) and data-level parallelism (DLP) that exists within each iteration of the Markov chain. More specifically, a combination of CPU threads using OpenMP and GPU threads using NVIDIA´s CUDA paradigm is implemented to speed up the sampling of each genetic marker used in creating the model. Performance speedup will allow this algorithm to accommodate the expected increase in observations on animals and genetic markers per observation. The current implementation executes 1.84 times faster than the optimized CPU implementation.
  • Keywords
    Bayes methods; Markov processes; Monte Carlo methods; bioinformatics; genetics; genomics; graphics processing units; iterative methods; parallel algorithms; parallel architectures; Bayesian inference; CPU threads; DLP; GEBV; GPU threads; GenSel; Gibbs sampling; MCMC; Markov Chain Monte Carlo algorithm; Markov chain iteration; NVIDIA´s CUDA paradigm; OpenMP; TLP; data-level parallelism; genetic marker sampling; genomic estimated breeding values; genotyped individuals; heterogeneous compute solution; heterogeneous computing solution; loop carried dependence; optimized CPU implementation; optimized genetic selection analysis tool; optimized genomic selection analysis; performance speedup; task-level parallelism; Algorithm design and analysis; Computational modeling; Data models; Genetics; Graphics processing units; Parallel processing; Vectors; Bayes methods; GPU; Genetic Selection; Heterogeneous computing; Markov processes; Monte Carlo methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
  • Conference_Location
    Belfast
  • Type

    conf

  • DOI
    10.1109/BIBM.2014.6999186
  • Filename
    6999186