Title :
Optimization Techniques and Performance Analyses of Two Life Science Algorithms for Novel GPU Architectures
Author :
Dilch, David ; Mehofer, Eduard
Author_Institution :
Res. Group Sci. Comput., Univ. of Vienna, Vienna, Austria
Abstract :
In this paper we evaluate two life science algorithms, namely Needleman-Wunsch sequence alignment and Direct Coulomb Summation, for GPUs. Whereas for Needleman-Wunsch it is difficult to get good performance numbers, Direct Coulomb Summation is particularly suitable for graphics cards. We present several optimization techniques, analyze the theoretical potential of the optimizations with respect to the algorithms, and measure the effect on execution times. We target the recent NVIDIA Fermi architecture to evaluate the performance impacts of novel hardware features like the cache subsystem on optimizing transformations. We compare the execution times of CUDA and OpenCL code versions for Fermi and predecessor models with parallel OpenMP versions executed on the main CPU.
Keywords :
cache storage; graphics processing units; optimisation; parallel architectures; performance evaluation; CUDA; GPU architectures; NVIDIA Fermi architecture; Needleman-Wunsch sequence alignment; OpenCL code versions; cache subsystem; direct Coulomb summation; graphics cards; life science algorithms; main CPU; optimization techniques; parallel OpenMP versions; performance analyses; Computer architecture; Graphics processing unit; Hardware; Instruction sets; Kernel; Optimization; Synchronization; CUDA; Fermi; OpenCL; life science algorithms; optimizations; performance evaluation;
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2012 20th Euromicro International Conference on
Conference_Location :
Garching
Print_ISBN :
978-1-4673-0226-5
DOI :
10.1109/PDP.2012.87