• DocumentCode
    720547
  • Title

    Optimizing the Bayesian Inference of Phylogeny on Graphic Processors

  • Author

    Cheng Ling ; Chunbao Zhou ; Arong Luo ; Guoguang Zhao ; Hamada, Tsuyoshi ; Xiaoyan Zhu

  • Author_Institution
    Guangzhou Inst. of Adv. Technol., Tsinghua Univ., Guangzhou, China
  • fYear
    2015
  • fDate
    4-7 May 2015
  • Firstpage
    333
  • Lastpage
    342
  • Abstract
    Searching for the evolutionary relationships between groups of organism has become a routine procedure in molecular biology. MrBayes is a popular model based phylogenetic inference tool using Bayesian statistics. Unfortunately, the computational cost is very high, resulting in undesirably long execution time. In this paper, we present what we believe the fastest solution of the MrBayes MC3 algorithm running on off-the-shelf graphic processors. The performance benefits are offered by the multi-granularity parallelism model, coarse-grained GPU kernel system, efficient thread arrangement strategy and GPU code level optimizations. MrBayes goMC3 (proposed herein) provides a significant performance improvement over the sequential MrBayes MC3 by a speedup of up to 48× when using single Tesla C2075 GPU card, whereas a speedup factor of 77× can be achieved when using dual GPUs. In comparison to the state-of-the-art version of other publicly available GPU implementations of MrBayes MC3, the cumulative optimizations adopted in goMC3 resulted in a speedup of up 2.5× over oMC3 (v1.0), 1.75× over tgMC3 (v1.0) and 1.46× over nMC3(v2.1.1) for realistic empirical biological datasets. Besides, experimental results indicated that goMC3 outstrips these GPU implementations on the analysis of simulated datasets composed of ultra-large-scale sequences. As a consequence, the reported performance improvement of goMC3 is significant and appears to scale well with increasing dataset sizes.
  • Keywords
    Bayes methods; bioinformatics; evolution (biological); genetics; graphics processing units; inference mechanisms; multi-threading; Bayesian inference optimization; Bayesian statistics; GPU code level optimizations; MrBayes goMC3; Tesla C2075 GPU card; coarse-grained GPU kernel system; computational cost; cumulative optimizations; empirical biological datasets; evolutionary relationship search; graphic processors; model based phylogenetic inference tool; molecular biology; multigranularity parallelism model; off-the-shelf graphic processors; performance improvement; phylogeny; sequential MrBayes MC3; speedup factor; thread arrangement strategy; ultralarge-scale sequences; Computational modeling; Graphics processing units; Instruction sets; Kernel; Parallel processing; Phylogeny; Probability; CUDA; MrBayes; phylogenetic analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/CCGrid.2015.13
  • Filename
    7152499