DocumentCode :
3090981
Title :
accULL: An User-directed Approach to Heterogeneous Programming
Author :
Reyes, Ruymán ; López, Ivan ; Fumero, Juan J. ; De Sande, Francisco
Author_Institution :
Dept. de EIO y Comput., Univ. de La Laguna, La Laguna, Spain
fYear :
2012
fDate :
10-13 July 2012
Firstpage :
654
Lastpage :
661
Abstract :
The world of HPC is undergoing rapid changes and computer architectures capable to achieve high performance have broadened. The irruption in the scene of computational accelerators, like GPUs, is increasing performance while maintaining low cost per GFLOP, thus expanding the popularity of HPC. However, it is still difficult to exploit the new complex processor hierarchies. To adapt the message passing model to program heterogeneous CPU+GPUs environments is not an easy task. Furthermore, message passing does not seem to be the best choice from the programmer point of view. Traditional shared memory approaches like OpenMP are interesting to ease the popularization of these platforms, but the fact is that GPU devices are connected to the CPU through a bus and have a separate memory space. We need to find a way to deal with this issue at programming language level, otherwise, developers will spend most of their time focusing on low-level code details instead of algorithmic enhancements. The recent advent of the OpenACC standard for heterogeneous computing represents an effort in the direction of leveraging the development effort. This initiative, combined with future releases of the OpenMP standard, will converge into a fully heterogeneous framework that will cope the programming requirements of future computer architectures. In this work we present preliminary results of accULL, a novel implementation of the OpenACC standard, based on a source-to-source compiler and a runtime library. To our knowledge, our approach is the first providing support for both OpenCL and CUDA platforms under this new standard.
Keywords :
application program interfaces; graphics processing units; parallel programming; program compilers; shared memory systems; software libraries; GFLOP; GPU; HPC; OpenACC standard; OpenMP; accULL; computational accelerators; computer architectures; graphics processing unit; heterogeneous programming; high performance computing; memory space; processor hierarchies; programming language level; runtime library; shared memory approaches; source-to-source compiler; user-directed approach; Computer architecture; Graphics processing unit; Kernel; Programming; Runtime; Standards; Accelerators; CUDA; GPGPU; OpenACC; OpenCL; OpenMP; code performance; compiler; productivity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on
Conference_Location :
Leganes
Print_ISBN :
978-1-4673-1631-6
Type :
conf
DOI :
10.1109/ISPA.2012.97
Filename :
6280357
Link To Document :
بازگشت