• DocumentCode
    3318392
  • Title

    Area and time efficient implementations of matrix multiplication on FPGAs

  • Author

    Jang, Ju-wook ; Choi, Seonil ; Prasanna, Viktor K.

  • Author_Institution
    Electron. Eng., Sogang Univ., Seoul, South Korea
  • fYear
    2002
  • fDate
    16-18 Dec. 2002
  • Firstpage
    93
  • Lastpage
    100
  • Abstract
    We develop new algorithms and architectures for matrix multiplication on configurable hardware. These designs significantly reduce the latency as well as the area. Our designs improve the previous designs in terms of the area/speed metric where the speed denotes the maximum achievable running frequency. The area/speed metrics for the previous designs and our design are 14.45, 4.93, and 2.35, respectively, for 4 × 4 matrix multiplication. The latency of one of the previous design is 0.57 μs, while our design takes 0.15 μs using 18% less area. The area of our designs is smaller by 11% - 46% compared with the best known systolic designs with the same latency for the matrices of sizes 3 × 3 - 12 × 12. The performance improvements tend to grow with the problem size.
  • Keywords
    field programmable gate arrays; matrix multiplication; FPGA; architecture design; area/speed metric; configurable hardware; latency; matrix multiplication algorithm; Delay; Field programmable gate arrays; Frequency; Graphics; Hardware; Image processing; Kernel; Robots; Signal processing; Signal processing algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Technology, 2002. (FPT). Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7803-7574-2
  • Type

    conf

  • DOI
    10.1109/FPT.2002.1188669
  • Filename
    1188669