Title :
Automatic Sliding Window Operation Optimization for FPGA-Based
Author :
Yu, Haiqian ; Leeser, Miriam
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA
Abstract :
FPGA-based computing boards are frequently used as hardware accelerators for image processing algorithms based on sliding window operations (SWOs). SWOs are both computationally intensive and data intensive and benefit from hardware acceleration with FPGAs, especially for delay sensitive applications. The current design process requires that, for each specific application using SWOs with different size of window, image, etc.; a detail design must be completed before a realistic estimate of the achievable speedup can be obtained. We present an automated tool, sliding window operation optimization (SWOOP), that generates the estimate of speedup for a high performance design before detailed implementation is complete. The achievable speedup is determined by the area of the FPGA, or, more often, the memory bandwidth to the processing elements. The memory bandwidth to each processing element is a combination of bandwidth to the FPGA and the efficient use of on-chip RAM as a data cache. SWOOP uses analytic techniques to automatically determine the number of parallel processing elements to implement on the FPGA, the assignment of input and output data to on-board memory, and the organization of data in on-chip memory to most effectively keep the processing elements busy. The result is a block layout of the final design, its memory architecture, and a measure of the achievable speedup. The results, compared to manual designs, show that the estimates obtained usinq SWOOP are very accurate
Keywords :
field programmable gate arrays; memory architecture; parallel processing; random-access storage; computing boards; data cache; field programmable gate arrays; hardware accelerators; image processing; memory architecture; memory bandwidth; on-chip RAM; parallel processing; sliding window operation optimization; Acceleration; Bandwidth; Delay; Design optimization; Field programmable gate arrays; Hardware; Image processing; Process design; Random access memory; Read-write memory;
Conference_Titel :
Field-Programmable Custom Computing Machines, 2006. FCCM '06. 14th Annual IEEE Symposium on
Conference_Location :
Napa, CA
Print_ISBN :
0-7695-2661-6
DOI :
10.1109/FCCM.2006.29