DocumentCode :
3757983
Title :
Extensions over OpenCL for Latency Reduction and Critical Applications
Author :
Grigore Lupescu;Emil-Ioan Slusanschi;Nicolae Tapus
Author_Institution :
Comput. Sci. &
fYear :
2015
Firstpage :
379
Lastpage :
385
Abstract :
Hardware and software stack complexity make programming GPGPUs difficult and limit application portability. This article first discusses challenges imposed by the current hardware and software model in GPGPU systems which relies heavily on the HOST device (CPU). We then identify system bottlenecks both in the hardware design and in the software stack and present two ideas to extend the HOST and DEVICE side of the OpenCL API with the aim to improve latency and device safety. As a first goal we target HOST side latency reduction using user synchronization directives. Our second goal was to improve on DEVICE side latency and add safety through a software layer which manages kernel execution. For both HOST and DEVICE side latency reduction we present concrete performance results.
Keywords :
"Kernel","Hardware","Computer architecture","Runtime","Program processors","Message systems"
Publisher :
ieee
Conference_Titel :
Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2015 17th International Symposium on
Type :
conf
DOI :
10.1109/SYNASC.2015.64
Filename :
7426108
Link To Document :
بازگشت