• DocumentCode
    2806368
  • Title

    High Performance Matrix Inversion on a Multi-core Platform with Several GPUs

  • Author

    Ezzatti, Pablo ; Quintana-Ortí, Enrique S. ; Remón, Alfredo

  • Author_Institution
    Centro de Calculo, Univ. de la Republica, Montevideo, Uruguay
  • fYear
    2011
  • fDate
    9-11 Feb. 2011
  • Firstpage
    87
  • Lastpage
    93
  • Abstract
    Inversion of large-scale matrices appears in a few scientific applications like model reduction or optimal control. Matrix inversion requires an important computational effort and, therefore, the application of high performance computing techniques and architectures for matrices with dimension in the order of thousands. Following the recent uprise of graphics processors (GPUs), we present and evaluate high performance codes for matrix inversion, based on Gauss-Jordan elimination with partial pivoting, which off-load the main computational kernels to one or more GPUs while performing fine-grain operations on the general-purpose processor. The target architecture consists of a multi-core processor connected to several GPUs. Parallelism is extracted from parallel implementations of BLAS and from the concurrent execution of operations in the available computational units. Numerical experiments on a system with two Intel QuadCore processors and four NVIDIA cl060 GPUs illustrate the efficiency and the scalability of the different implementations, which deliver over 1.2 x 1012 floating point operations per second.
  • Keywords
    computer graphic equipment; coprocessors; matrix algebra; multiprocessing systems; GPU; Gauss-Jordan elimination; Intel QuadCore processors; floating point operations; graphics processors; matrix inversion; multicore platform; multicore processor; optimal control; parallel implementations; Electronic mail; Graphics processing unit; Multicore processing; Parallel processing; Partitioning algorithms; GPUs; linear algebra; matrix inversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2011 19th Euromicro International Conference on
  • Conference_Location
    Ayia Napa
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4244-9682-2
  • Type

    conf

  • DOI
    10.1109/PDP.2011.66
  • Filename
    5738989