Tridiagonalization of a Symmetric Dense Matrix on a GPU Cluster

Author

Yamazaki, Ichitaro ; Tingxing Dong ; Tomov, Stanimire ; Dongarra, Jack

Author_Institution

Electr. Eng. & Comput. Sci. Dept., Univ. of Tennessee, Knoxville, TN, USA

fYear

2013

fDate

20-24 May 2013

Firstpage

1070

Lastpage

1079

Abstract

Symmetric dense Eigen value problems arise in many scientific and engineering simulations. In this paper, we use GPUs to accelerate its main computational kernel, the tridiagonalization of a dense symmetric matrix on a distributed multicore architecture. We then study the performance of this hybrid message-passing/shared-memory/GPU-computing paradigm on up to 16 compute nodes, each of which consists of 16 Intel Sandy Bridge processors and three NVIDIA GPUs. These studies show that such a hybrid paradigm can exploit the underlying hardware architecture and obtain significant speedups over a flat message-passing paradigm can, and they demonstrate a potential of efficiently solving large-scale Eigen value problems on a GPU cluster. Furthermore, these studies may provide insights on the general effects of such hybrid paradigms on emerging high-performance computers.

Keywords

eigenvalues and eigenfunctions; graphics processing units; mathematics computing; matrix algebra; message passing; parallel architectures; shared memory systems; Intel SandyBridge processors; NVIDIA GPU cluster; computational kernel; distributed multicore architecture; hardware architecture; high-performance computers; hybrid message passing-shared-memory-GPU-computing paradigm; large-scale eigenvalue problems; symmetric dense-eigenvalue problems; symmetric dense-matrix tridiagonalization process; Computer architecture; Graphics processing units; Handheld computers; Kernel; Layout; Symmetric matrices; Vectors; GPU cluster; dense symmetric tridiagonalization; distributed multicores; hybrid programming;

fLanguage

English

Publisher

ieee

Conference_Titel

Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International

Conference_Location

Cambridge, MA

Print_ISBN

978-0-7695-4979-8

Type

conf

DOI

10.1109/IPDPSW.2013.265

Filename

6650992