• DocumentCode
    229178
  • Title

    Co-exploration of NLA kernels and specification of Compute Elements in distributed memory CGRAs

  • Author

    Mahadurkar, Mahesh ; Merchant, Farhad ; Maity, Avisek ; Vatwani, Kapil ; Munje, Ishan ; Gopalan, Nakul ; Nandy, S.K. ; Narayan, Rohit

  • Author_Institution
    CADLab., Indian Inst. of Sci., Bangalore, India
  • fYear
    2014
  • fDate
    14-17 July 2014
  • Firstpage
    225
  • Lastpage
    232
  • Abstract
    Coarse Grained Reconfigurable Architectures (CGRA) are emerging as embedded application processing units in computing platforms for Exascale computing. Such CGRAs are distributed memory multi-core compute elements on a chip that communicate over a Network-on-chip (NoC). Numerical Linear Algebra (NLA) kernels are key to several high performance computing applications. In this paper we propose a systematic methodology to obtain the specification of Compute Elements (CE) for such CGRAs. We analyze block Matrix Multiplication and block LU Decomposition algorithms in the context of a CGRA, and obtain theoretical bounds on communication requirements, and memory sizes for a CE. Support for high performance custom computations common to NLA kernels are met through custom function units (CFUs) in the CEs. We present results to justify the merits of such CFUs.
  • Keywords
    distributed memory systems; matrix multiplication; network-on-chip; parallel processing; reconfigurable architectures; CE; CFU; NLA kernel co-exploration; NoC; block LU decomposition algorithm; block matrix multiplication; coarse grained reconfigurable architectures; compute element specification; custom function units; distributed memory CGRA; distributed memory multicore compute elements; embedded application processing units; exascale computing; high performance computing applications; high performance custom computations; network-on-chip; numerical linear algebra kernel; Algorithm design and analysis; Computational modeling; Kernel; Matrix decomposition; Memory management; Registers; CGRA; computation; numerical linear algebra; parallelism;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XIV), 2014 International Conference on
  • Conference_Location
    Agios Konstantinos
  • Type

    conf

  • DOI
    10.1109/SAMOS.2014.6893215
  • Filename
    6893215