DocumentCode
3048958
Title
K-loops: Loop skewing for Reconfigurable Architectures
Author
Dragomir, Ozana Silvia ; Bertels, Koen
Author_Institution
Comput. Eng., Tech. Univ. Delft, Delft, Netherlands
fYear
2009
fDate
9-11 Dec. 2009
Firstpage
199
Lastpage
206
Abstract
In this paper, we propose new techniques for improving the performance of applications running on a reconfigurable platform supporting the Molen programming paradigm. We focus on parallelizing loops that contain hardware-mapped kernels in the loop body (called K-loops) with wavefront-like dependencies. For this purpose, we use traditional transformations, such as loop skewing for eliminating the dependencies and loop unrolling for parallelization. The first technique presented in this paper improves the application performance by running in parallel on the reconfigurable hardware multiple instances of the kernel. The second technique extends the first one and determines how many kernel instances should be scheduled for software execution in each iteration, concurrently with the hardware execution, such that the hardware and software times are balanced. In the experimental part, we present results when parallelizing the deblocking filter (DF), which is part of the H.264 encoder and decoder, after skewing the main DF loop to eliminate the data dependencies. For the unroll factor 8, we report a loop speedup of up to 4.78.
Keywords
parallel processing; reconfigurable architectures; H.264 encoder; K-loops; Molen programming paradigm; deblocking filter; hardware-mapped kernels; loop skewing; reconfigurable architectures; wavefront-like dependencies; Acceleration; Application software; Coprocessors; Filters; Hardware; Image processing; Iterative decoding; Kernel; Reconfigurable architectures; Scheduling;
fLanguage
English
Publisher
ieee
Conference_Titel
Field-Programmable Technology, 2009. FPT 2009. International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-1-4244-4375-8
Electronic_ISBN
978-1-4244-4377-2
Type
conf
DOI
10.1109/FPT.2009.5377656
Filename
5377656
Link To Document