• DocumentCode
    1997141
  • Title

    Tridiagonalization of a Symmetric Dense Matrix on a GPU Cluster

  • Author

    Yamazaki, Ichitaro ; Tingxing Dong ; Tomov, Stanimire ; Dongarra, Jack

  • Author_Institution
    Electr. Eng. & Comput. Sci. Dept., Univ. of Tennessee, Knoxville, TN, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1070
  • Lastpage
    1079
  • Abstract
    Symmetric dense Eigen value problems arise in many scientific and engineering simulations. In this paper, we use GPUs to accelerate its main computational kernel, the tridiagonalization of a dense symmetric matrix on a distributed multicore architecture. We then study the performance of this hybrid message-passing/shared-memory/GPU-computing paradigm on up to 16 compute nodes, each of which consists of 16 Intel Sandy Bridge processors and three NVIDIA GPUs. These studies show that such a hybrid paradigm can exploit the underlying hardware architecture and obtain significant speedups over a flat message-passing paradigm can, and they demonstrate a potential of efficiently solving large-scale Eigen value problems on a GPU cluster. Furthermore, these studies may provide insights on the general effects of such hybrid paradigms on emerging high-performance computers.
  • Keywords
    eigenvalues and eigenfunctions; graphics processing units; mathematics computing; matrix algebra; message passing; parallel architectures; shared memory systems; Intel SandyBridge processors; NVIDIA GPU cluster; computational kernel; distributed multicore architecture; hardware architecture; high-performance computers; hybrid message passing-shared-memory-GPU-computing paradigm; large-scale eigenvalue problems; symmetric dense-eigenvalue problems; symmetric dense-matrix tridiagonalization process; Computer architecture; Graphics processing units; Handheld computers; Kernel; Layout; Symmetric matrices; Vectors; GPU cluster; dense symmetric tridiagonalization; distributed multicores; hybrid programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.265
  • Filename
    6650992