DocumentCode :
1986047
Title :
Performance and area modeling of complete FPGA designs in the presence of loop transformations
Author :
Shayee, K. R Shesha ; Park, Joonseok ; Diniz, Pedro C.
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fYear :
2003
fDate :
9-11 April 2003
Firstpage :
296
Abstract :
Digital image processing algorithms are a good match for direct implementation on FPGAs as current FPGA architectures can naturally match the fine grain parallelism in these applications. Typically, these algorithms are structured as a sequence of operations, expressed in high-level programming languages as tight loop nests. The loops usually define a shifting-window region over which the algorithm applies a simple localized operator (e.g., a differential gradient, or a min/max). In this research we focus on the development of fast, yet accurate performance and area modeling of complete FPGA designs that combine analytical, empirical and behavioral estimation techniques. We model the application of a set of important program transformations for image processing algorithms, namely loop unrolling, tiling, loop interchanging, loop fission and array privatization, and explore pipelined and non-pipelined execution modes. We take into consideration the impact of various transformations, in the presence of limited I/O resources like address generators and external memory data channels, on the performance of a complete design implemented in a FPGA based architecture.
Keywords :
field programmable gate arrays; gradient methods; image processing; input-output programs; logic design; performance evaluation; program control structures; reconfigurable architectures; FPGA architecture; FPGA design; I/O resource; address generator; analytical estimation; area modeling; array privatization; behavioral estimation; differential gradient; digital image processing algorithm; empirical estimation; external memory data channel; fine grain parallelism; high-level programming language; localized operator; loop transformation; operation sequence; performance modeling; program transformation; sequence structured; shifting-window region; tight loop nest; Bandwidth; Computer languages; Delay; Field programmable gate arrays; Hardware; Image processing; Information analysis; Parallel processing; Performance analysis; Read-write memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Custom Computing Machines, 2003. FCCM 2003. 11th Annual IEEE Symposium on
Print_ISBN :
0-7695-1979-2
Type :
conf
DOI :
10.1109/FPGA.2003.1227278
Filename :
1227278
Link To Document :
بازگشت