DocumentCode
1955455
Title
Cooperative heterogeneous computing for parallel processing on CPU/GPU hybrids
Author
Lee, Changmin ; Ro, Won W. ; Gaudiot, Jean-Luc
Author_Institution
Sch. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
fYear
2012
fDate
25-25 Feb. 2012
Firstpage
33
Lastpage
40
Abstract
This paper presents a cooperative heterogeneous computing framework which enables the efficient utilization of available computing resources of host CPU cores for CUDA kernels, which are designed to run only on GPU. The proposed system exploits at runtime the coarse-grain thread-level parallelism across CPU and GPU, without any source recompilation. To this end, three features including a work distribution module, a transparent memory space, and a global scheduling queue are described in this paper. With a completely automatic runtime workload distribution, the proposed framework achieves speedups as high as 3.08 compared to the baseline GPU-only processing.
Keywords
graphics processing units; multi-threading; parallel architectures; processor scheduling; CPU core; CPU/GPU hybrids; CUDA kernel; GPU-only processing; coarse-grain thread-level parallelism; computing resources; cooperative heterogeneous computing; global scheduling queue; parallel processing; runtime workload distribution; transparent memory space; work distribution module; Central Processing Unit; Graphics processing unit; Instruction sets; Kernel; Multicore processing; Parallel processing; Runtime;
fLanguage
English
Publisher
ieee
Conference_Titel
Interaction between Compilers and Computer Architectures (INTERACT), 2012 16th Workshop on
Conference_Location
New Orleans, LA
ISSN
1550-6207
Print_ISBN
978-1-4673-2613-1
Type
conf
DOI
10.1109/INTERACT.2012.6339624
Filename
6339624
Link To Document