Title :
OpenCL computing on FPGA using multiported
Author :
Shared Memory;Tahsin Turker Mutlugun; Sheng-De Wang
Author_Institution :
Dept of Electrical Engineering, National Taiwan University, Taipei 106, Taiwan
Abstract :
This paper focuses on memory access improvements for the OpenCL architecture for FPGAs with the goal of achieving trade-off between performance and required resources. In OpenCL compute units, there is usually a linear relation between computation time and local memory access latency. This latency is normally hidden by increasing the parallel workload. However, with such an approach, the target FPGA device could easily run out of resources. In this work, conflict-free multiported memories are used to minimize local memory access latency. Experiments show that multiported memories can successfully increase computation speed and reduce the required parallel workload for maximum throughput to practical amounts.
Keywords :
"Random access memory","Ports (Computers)","Field programmable gate arrays","Memory management","Kernel","Clocks","Prefetching"
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2015 25th International Conference on
DOI :
10.1109/FPL.2015.7293983