Title :
K-loops: Loop skewing for Reconfigurable Architectures
Author :
Dragomir, Ozana Silvia ; Bertels, Koen
Author_Institution :
Comput. Eng., Tech. Univ. Delft, Delft, Netherlands
Abstract :
In this paper, we propose new techniques for improving the performance of applications running on a reconfigurable platform supporting the Molen programming paradigm. We focus on parallelizing loops that contain hardware-mapped kernels in the loop body (called K-loops) with wavefront-like dependencies. For this purpose, we use traditional transformations, such as loop skewing for eliminating the dependencies and loop unrolling for parallelization. The first technique presented in this paper improves the application performance by running in parallel on the reconfigurable hardware multiple instances of the kernel. The second technique extends the first one and determines how many kernel instances should be scheduled for software execution in each iteration, concurrently with the hardware execution, such that the hardware and software times are balanced. In the experimental part, we present results when parallelizing the deblocking filter (DF), which is part of the H.264 encoder and decoder, after skewing the main DF loop to eliminate the data dependencies. For the unroll factor 8, we report a loop speedup of up to 4.78.
Keywords :
parallel processing; reconfigurable architectures; H.264 encoder; K-loops; Molen programming paradigm; deblocking filter; hardware-mapped kernels; loop skewing; reconfigurable architectures; wavefront-like dependencies; Acceleration; Application software; Coprocessors; Filters; Hardware; Image processing; Iterative decoding; Kernel; Reconfigurable architectures; Scheduling;
Conference_Titel :
Field-Programmable Technology, 2009. FPT 2009. International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-4375-8
Electronic_ISBN :
978-1-4244-4377-2
DOI :
10.1109/FPT.2009.5377656