• DocumentCode
    1514431
  • Title

    Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction

  • Author

    Tai, Yi-Gang ; Lo, Chia-Tien Dan ; Psarris, Kleanthis

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at San Antonio, San Antonio, TX, USA
  • Volume
    23
  • Issue
    2
  • fYear
    2012
  • Firstpage
    202
  • Lastpage
    210
  • Abstract
    Many scientific or engineering applications involve matrix operations, in which reduction of vectors is a common operation. If the core operator of the reduction is deeply pipelined, which is usually the case, dependencies between the input data elements cause data hazards. To tackle this problem, we propose a new reduction method with low latency and high pipeline utilization. The performance of the proposed design is evaluated for both single data set and multiple data set scenarios. Further, QR decomposition is used to demonstrate how the proposed method can accelerate its execution. We implement the design on an FPGA and compare its results to other methods.
  • Keywords
    digital arithmetic; field programmable gate arrays; matrix algebra; FPGA; QR decomposition; improved deeply pipelined vector reduction; matrix operations; Adders; Clocks; Hardware; Merging; Pipeline processing; Pipelines; Registers; Reconfigurable hardware; algorithm design and analysis.; parallel algorithms; parallel and vector implementations; pipeline processors;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2011.141
  • Filename
    5765947