• DocumentCode
    2974710
  • Title

    Efficient methods for out-of-order load/store execution for high-performance soft processors

  • Author

    Wong, Hang ; Betz, Vaughn ; Rose, J.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
  • fYear
    2013
  • fDate
    9-11 Dec. 2013
  • Firstpage
    442
  • Lastpage
    445
  • Abstract
    As FPGAs continue to increase in size, it becomes increasingly feasible and desirable to build higher performance soft processors. Preserving the familiar single-threaded programming model can be done with an out of order processor. The ability to execute memory loads and stores out of order has a large impact on performance, but this is difficult to do because the dependencies between stores and loads are not known until addresses are computed. Out of order memory disambiguation is traditionally done with CAMs in the load queue and store queue, but large CAMs are inefficient on FPGAs. Store Queue Index Prediction (SQIP) and NoSQ propose to replace CAMs with store-load forwarding prediction and load re-execution. We implement four memory disambiguation schemes (in-order, CAM, SQIP, NoSQ) on a Stratix IV FPGA and evaluate the area and delay trade-offs. We find that CAM area and delay degrade quickly with load/store queue size, while SQIP and NoSQ have little degradation with queue size but have area overhead for prediction and predictor training hardware. SQIP and NoSQ use less area than CAMs beyond 32 and 16 load/store queue entries, respectively, and have higher maximum frequency beyond 4 entries.
  • Keywords
    content-addressable storage; field programmable gate arrays; multiprocessing systems; CAM; NoSQ; SQIP; Stratix IV FPGA; content addressable memory; high-performance soft processors; load-store queue size; memory disambiguation schemes; out-of-order load-store execution; predictor training hardware; store queue index prediction; Cams; Computer aided manufacturing; Field programmable gate arrays; Hardware; Out of order; Random access memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Technology (FPT), 2013 International Conference on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4799-2199-7
  • Type

    conf

  • DOI
    10.1109/FPT.2013.6718409
  • Filename
    6718409