• DocumentCode
    3299445
  • Title

    Automatically Tuned Linear Algebra Software

  • Author

    Whaley, R. Clint ; Dongarra, Jack J.

  • fYear
    1998
  • fDate
    07-13 Nov. 1998
  • Firstpage
    38
  • Lastpage
    38
  • Abstract
    This paper describes an approach for the automatic generation and optimization of numerical software for processors with deep memory hierarchies and pipelined functional units. The production of such software for machines ranging from desktop workstations to embedded processors can be a tedious and time consuming process. The work described here can help in automating much of this process. We will concentrate our efforts on the widely used linear algebra kernels called the Basic Linear Algebra Subroutines (BLAS). In particular, the work presented here is for general matrix multiply, DGEMM. However much of the technology and approach developed here can be applied to the other Level 3 BLAS and the general strategy can have an impact on basic linear algebra operations in general and may be extended to other important kernel operations.
  • Keywords
    BLAS; code; high performance; linear algebra; optimization; tuning; Algorithms; Buildings; Embedded software; Kernel; Lifting equipment; Linear algebra; Parallel processing; Production; Timing; Workstations; BLAS; code; high performance; linear algebra; optimization; tuning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, 1998.SC98. IEEE/ACM Conference on
  • Print_ISBN
    0-8186-8707-X
  • Type

    conf

  • DOI
    10.1109/SC.1998.10004
  • Filename
    1437325