• DocumentCode
    2785920
  • Title

    Automatic Sliding Window Operation Optimization for FPGA-Based

  • Author

    Yu, Haiqian ; Leeser, Miriam

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA
  • fYear
    2006
  • fDate
    24-26 April 2006
  • Firstpage
    76
  • Lastpage
    88
  • Abstract
    FPGA-based computing boards are frequently used as hardware accelerators for image processing algorithms based on sliding window operations (SWOs). SWOs are both computationally intensive and data intensive and benefit from hardware acceleration with FPGAs, especially for delay sensitive applications. The current design process requires that, for each specific application using SWOs with different size of window, image, etc.; a detail design must be completed before a realistic estimate of the achievable speedup can be obtained. We present an automated tool, sliding window operation optimization (SWOOP), that generates the estimate of speedup for a high performance design before detailed implementation is complete. The achievable speedup is determined by the area of the FPGA, or, more often, the memory bandwidth to the processing elements. The memory bandwidth to each processing element is a combination of bandwidth to the FPGA and the efficient use of on-chip RAM as a data cache. SWOOP uses analytic techniques to automatically determine the number of parallel processing elements to implement on the FPGA, the assignment of input and output data to on-board memory, and the organization of data in on-chip memory to most effectively keep the processing elements busy. The result is a block layout of the final design, its memory architecture, and a measure of the achievable speedup. The results, compared to manual designs, show that the estimates obtained usinq SWOOP are very accurate
  • Keywords
    field programmable gate arrays; memory architecture; parallel processing; random-access storage; computing boards; data cache; field programmable gate arrays; hardware accelerators; image processing; memory architecture; memory bandwidth; on-chip RAM; parallel processing; sliding window operation optimization; Acceleration; Bandwidth; Delay; Design optimization; Field programmable gate arrays; Hardware; Image processing; Process design; Random access memory; Read-write memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Custom Computing Machines, 2006. FCCM '06. 14th Annual IEEE Symposium on
  • Conference_Location
    Napa, CA
  • Print_ISBN
    0-7695-2661-6
  • Type

    conf

  • DOI
    10.1109/FCCM.2006.29
  • Filename
    4020897