DocumentCode :
720547
Title :
Optimizing the Bayesian Inference of Phylogeny on Graphic Processors
Author :
Cheng Ling ; Chunbao Zhou ; Arong Luo ; Guoguang Zhao ; Hamada, Tsuyoshi ; Xiaoyan Zhu
Author_Institution :
Guangzhou Inst. of Adv. Technol., Tsinghua Univ., Guangzhou, China
fYear :
2015
fDate :
4-7 May 2015
Firstpage :
333
Lastpage :
342
Abstract :
Searching for the evolutionary relationships between groups of organism has become a routine procedure in molecular biology. MrBayes is a popular model based phylogenetic inference tool using Bayesian statistics. Unfortunately, the computational cost is very high, resulting in undesirably long execution time. In this paper, we present what we believe the fastest solution of the MrBayes MC3 algorithm running on off-the-shelf graphic processors. The performance benefits are offered by the multi-granularity parallelism model, coarse-grained GPU kernel system, efficient thread arrangement strategy and GPU code level optimizations. MrBayes goMC3 (proposed herein) provides a significant performance improvement over the sequential MrBayes MC3 by a speedup of up to 48× when using single Tesla C2075 GPU card, whereas a speedup factor of 77× can be achieved when using dual GPUs. In comparison to the state-of-the-art version of other publicly available GPU implementations of MrBayes MC3, the cumulative optimizations adopted in goMC3 resulted in a speedup of up 2.5× over oMC3 (v1.0), 1.75× over tgMC3 (v1.0) and 1.46× over nMC3(v2.1.1) for realistic empirical biological datasets. Besides, experimental results indicated that goMC3 outstrips these GPU implementations on the analysis of simulated datasets composed of ultra-large-scale sequences. As a consequence, the reported performance improvement of goMC3 is significant and appears to scale well with increasing dataset sizes.
Keywords :
Bayes methods; bioinformatics; evolution (biological); genetics; graphics processing units; inference mechanisms; multi-threading; Bayesian inference optimization; Bayesian statistics; GPU code level optimizations; MrBayes goMC3; Tesla C2075 GPU card; coarse-grained GPU kernel system; computational cost; cumulative optimizations; empirical biological datasets; evolutionary relationship search; graphic processors; model based phylogenetic inference tool; molecular biology; multigranularity parallelism model; off-the-shelf graphic processors; performance improvement; phylogeny; sequential MrBayes MC3; speedup factor; thread arrangement strategy; ultralarge-scale sequences; Computational modeling; Graphics processing units; Instruction sets; Kernel; Parallel processing; Phylogeny; Probability; CUDA; MrBayes; phylogenetic analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
Type :
conf
DOI :
10.1109/CCGrid.2015.13
Filename :
7152499
Link To Document :
بازگشت