Title :
Efficient Computation of the Phylogenetic Likelihood Function on the Intel MIC Architecture
Author :
Kozlov, A.M. ; Goll, C. ; Stamatakis, A.
Author_Institution :
Sci. Comput. Group, Heidelberg Inst. for Theor. Studies, Heidelberg, Germany
Abstract :
Phylogenetic inference is the process of reconstructing the evolutionary history of species based on their traits, nowadays mostly using molecular sequence data. Current state-of-the-art inference methods, like Bayesian and Maximum Likelihood (ML) inference, rely on the Phylogenetic Likelihood Function (PLF) as their computational core. Due to the large number of floating-point operations involved, the PLF evaluation is the major bottleneck for large-scale phylogenetic analyses comprising thousands of genes or even whole genomes. Here, we describe an optimized implementation of the PLF kernel for the novel Intel Many Integrated Core (MIC) architecture. Using a MIC-based accelerator (Xeon Phi 5110P), we were able to achieve speedups ranging from 1.9× to 2.8× for different PLF kernels, compared to a highly optimized AVX implementation running on dual-socket Xeon E5-2680 system. By integrating the optimized PLF into the phylogenetic inference program RAxML-Light, we reduced the overall execution times by up to factor of two. To assess the scalability on multiple Xeon Phi cards, we also developed a hybrid MPIOpenMP version of the ExaML code. When ExaML is executed on two coprocessors on the same node, we obtain speedups of up to a factor of 3.7 (vs. a CPU baseline) and 1.8 (vs. a single MIC). As expected, speedups increase with growing dataset size and become stable for alignments that require processing 1-2 million sites per MIC card.
Keywords :
biocomputing; coprocessors; floating point arithmetic; message passing; multiprocessing systems; parallel processing; ExaML code; Intel MIC architecture; Intel many integrated core architecture; MIC-based accelerator; MPIOpenMP version; PLF kernel; RAxML-Light phylogenetic inference program; Xeon Phi 5110P; coprocessors; floating-point operations; large-scale phylogenetic analyses; molecular sequence data; phylogenetic likelihood function; Coprocessors; Hardware; Kernel; Microwave integrated circuits; Phylogeny; Prefetching; Vectors; bioinformatics; phylogenetics; maximum likelihood; Intel MIC; parallel processing; MPI;
Conference_Titel :
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-4117-9
DOI :
10.1109/IPDPSW.2014.198