DocumentCode
592033
Title
An Automatic Host and Device Memory Allocation Method for OpenMPC
Author
Uchiyama, Hiroyuki ; Tsumura, Tomoaki ; Matsuo, Hiroshi
Author_Institution
Nagoya Inst. of Technol., Nagoya, Japan
fYear
2012
fDate
5-7 Dec. 2012
Firstpage
208
Lastpage
214
Abstract
The CUDA programming model provides better abstraction for GPU programming. However, it is still hard to write programs with CUDA because both some specific techniques and knowledge about GPU architecture are required. Hence, many programming frameworks for CUDA have been developed. OpenMPC is one of them based on OpenMP. OpenMPC is an easy-to-write framework for programmers familiar with traditional OpenMP, but still requires programmers to use the special directives for utilizing fast device memories. To solve this problem, this paper proposes a method for allocating appropriate device memories automatically. This paper also proposes a method for automatically allocating page locked memory for the data which are transferred between host and device. The evaluation results with several programs show that proposed methods can reduce 52% execution time in maximum.
Keywords
graphics processing units; paged storage; parallel architectures; parallel programming; program compilers; program diagnostics; storage allocation; CUDA programming model; GPU architecture; GPU programming abstraction; OpenMPC; automatic device memory allocation method; automatic host allocation method; automatic page locked memory allocation method; graphics processing unit; programming frameworks; Arrays; Data transfer; Graphics processing units; Instruction sets; Kernel; Programming; Resource management; CUDA; GPGPU; OpenMPC; memory allocation;
fLanguage
English
Publisher
ieee
Conference_Titel
Networking and Computing (ICNC), 2012 Third International Conference on
Conference_Location
Okinawa
Print_ISBN
978-1-4673-4624-5
Type
conf
DOI
10.1109/ICNC.2012.39
Filename
6424565
Link To Document