DocumentCode :
167596
Title :
Design and Implementation of a Large Scale Tree-Based QR Decomposition Using a 3D Virtual Systolic Array and a Lightweight Runtime
Author :
Yamazaki, Ichitaro ; Kurzak, Jakub ; Luszczek, Piotr ; Dongarra, Jack
Author_Institution :
Univ. of Tennessee, Knoxville, TN, USA
fYear :
2014
fDate :
19-23 May 2014
Firstpage :
1495
Lastpage :
1504
Abstract :
A systolic array provides an alternative computing paradigm to the von Neuman architecture. Though its hardware implementation has failed as a paradigm to design integrated circuits in the past, we are now discovering that the systolic array as a software virtualization layer can lead to an extremely scalable execution paradigm. To demonstrate this scalability, in this paper, we design and implement a 3D virtual systolic array to compute a tile QR decomposition of a tall-and-skinny dense matrix. Our implementation is based on a state-of-the-art algorithm that factorizes a panel based on a tree-reduction. Using a runtime developed as a part of the Parallel Ultra Light Systolic Array Runtime (PULSAR) project, we demonstrate on a Cray-XT5 machine how our virtual systolic array can be mapped to a large-scale machine and obtain excellent parallel performance. This is an important contribution since such a QR decomposition is used, for example, to compute a least squares solution of an overdetermined system, which arises in many scientific and engineering problems.
Keywords :
matrix algebra; systolic arrays; trees (mathematics); 3D virtual systolic array; Cray-XT5 machine; PULSAR project; large scale tree-based QR decomposition; least squares solution; lightweight runtime; parallel performance; software virtualization layer; tall-and-skinny dense matrix; tile QR decomposition; tree-reduction; von Neuman architecture; Arrays; Computers; Hardware; Message systems; Runtime; Software; Three-dimensional displays; QR decomposition; Systolic array; dataflow; message-passing; multithreading; runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-4117-9
Type :
conf
DOI :
10.1109/IPDPSW.2014.167
Filename :
6969553
Link To Document :
بازگشت