Title :
Caching-efficient multithreaded fast multiplication of sparse matrices
Author :
Sulatycke, Peter D. ; Ghose, Kanad
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Binghamton, NY, USA
fDate :
30 Mar-3 Apr 1998
Abstract :
Several fast sequential algorithms have been proposed in the past to multiply sparse matrices. These algorithms do not explicitly address the impact of caching on performance. We show that a rather simple sequential cache-efficient algorithm provides significantly better performance than existing algorithms for sparse matrix multiplication. We then describe a multithreaded implementation of this simple algorithm and show that its performance scales well with the number of threads and CPUs. For 10% sparse, 500×500 matrices, the multithreaded version running on 4-CPU systems provides more than a 41.1-fold speed increase over the well-known BLAS routine and a 14.6 fold and 44.6-fold speed increase over two other recent techniques for fast sparse matrix multiplication, both of which are relatively difficult to parallelize efficiently
Keywords :
cache storage; mathematics computing; matrix multiplication; multiprogramming; parallel algorithms; software performance evaluation; sparse matrices; BLAS routine; CPU; caching; loop interchanging; multithreaded implementation; performance; sequential algorithms; sparse matrix multiplication; speed increase; Algorithm design and analysis; Computational fluid dynamics; Computer science; Data structures; Delay; Linear systems; Matrix converters; Sparse matrices; Symmetric matrices; Yarn;
Conference_Titel :
Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998
Conference_Location :
Orlando, FL
Print_ISBN :
0-8186-8404-6
DOI :
10.1109/IPPS.1998.669899