DocumentCode :
2061475
Title :
High performance implementation of tridiagonalization on the SR8000
Author :
Naono, Ken ; Yamamoto, Yiisaku ; Igai, Mitsuyoshi ; Hirayama, Hiroyuki
Author_Institution :
Central Res. Lab., Hitachi Ltd., Tokyo, Japan
Volume :
1
fYear :
2000
fDate :
14-17 May 2000
Firstpage :
206
Abstract :
The methods of high performance tridiagonalization on the HITACHI SR8000 are described and evaluated. To achieve high performance, we adopted the blocked tridiagonalization and the scattered square decomposition. In addition, to achieve more performance in one node, we took the ways of the rectangular computation in the diagonal blocks and the loop integration for reducing the number of read/write operations. On one node of the SR8000, we achieved about 4.0 Gflop/s in the 4000-dimension tridiagonalization of a real symmetric matrix. This is much better than the 2.9 Gflop/s of our matrix library´s on the HITACHI S-3800, which has the same peak performance with one node of the SR8000.
Keywords :
matrix decomposition; parallel algorithms; parallel machines; performance evaluation; 4.0 Gflop/s; HITACHI SR8000; blocked tridiagonalization; real symmetric matrix; scattered square decomposition; tridiagonalization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing in the Asia-Pacific Region, 2000. Proceedings. The Fourth International Conference/Exhibition on
Conference_Location :
Beijing, China
Print_ISBN :
0-7695-0589-2
Type :
conf
DOI :
10.1109/HPC.2000.846547
Filename :
846547
Link To Document :
بازگشت