DocumentCode :
3691882
Title :
The Approximate Discrete Radon Transform: A Case Study in Auto-Tuning of OpenCL Implementations
Author :
Bücker;Ralf Seidler; Neuhäuser;Tobias Beier
Author_Institution :
Carl Zeiss Jena GmbH, Jena, Germany
fYear :
2015
Firstpage :
219
Lastpage :
226
Abstract :
The Open Computing Language (OpenCL) is designed to provide a platform-independent specification for programming heterogenous computing systems. The performance of an OpenCL program, however, is not easily transferrable from one platform to another. Auto-tuning is among the techniques that address this situation by automating the performance optimization of OpenCL programs via systematically applying program transformations. We introduce a novel auto-tuning framework to generate OpenCL programs and report on a case study computing an approximate discrete Radon transform. Experiments on four different graphics processing units indicate that, for a wide range of problem sizes and input parameters, the execution times of the auto-tuned OpenCL programs are smaller than those of three hand-tuned CUDA implementations.
Keywords :
"Kernel","Radon","Transforms","Graphics processing units","Runtime","Arrays"
Publisher :
ieee
Conference_Titel :
Embedded Multicore/Many-core Systems-on-Chip (MCSoC), 2015 IEEE 9th International Symposium on
Type :
conf
DOI :
10.1109/MCSoC.2015.38
Filename :
7328208
Link To Document :
بازگشت