• DocumentCode
    3585584
  • Title

    Approaching overhead-free execution on FPGA soft-processors

  • Author

    LaForest, Charles Eric ; Anderson, Jason ; Steffan, J. Gregory

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
  • fYear
    2014
  • Firstpage
    99
  • Lastpage
    106
  • Abstract
    Implementing systems on FPGA soft-processors, rather than as custom hardware, eases and accelerates the development process, but at the cost of a great reduction in performance. Orthogonal to limitations in parallelism or clock frequency, this reduction in performance primarily originates in the intrinsic addressing and flow-control overheads of scalar microprocessors, which expend a considerable number of cycles interleaving address calculations and branch decisions within the actual useful work. We present an improved FPGA soft-processor architecture which statically overlaps "overhead" computations and executes them in parallel with the "useful" computations, significantly reducing the number of processor cycles needed to execute sequential programs, while reducing maximum clock frequency to 0.939x of its original value. In addition to eliminating almost all overhead computations, the proposed soft-processor can operate at 500 MHz on the Altera Stratix IV FPGA - 0.909x of the absolute maximum rating. Combined, the high speed and execution efficiency increase the range of FPGA designs amenable to soft-processors rather than custom hardware. We evaluate our cycle count improvements with multiple benchmarks, achieving speedups ranging from 1.07x for control-heavy code, to 1.92x for looping code, never performing worse than the original sequential code, and always performing better than a totally unrolled loop.
  • Keywords
    field programmable gate arrays; microprocessor chips; Altera Stratix IV FPGA - 0.909x; FPGA soft-processor architecture; clock frequency; flow-control overheads; frequency 500 MHz; overhead-free execution; scalar microprocessors; Field programmable gate arrays; Hardware; Instruction sets; Pipeline processing; Pipelines; Radiation detectors; Random access memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Technology (FPT), 2014 International Conference on
  • Print_ISBN
    978-1-4799-6244-0
  • Type

    conf

  • DOI
    10.1109/FPT.2014.7082760
  • Filename
    7082760