DocumentCode
1556564
Title
A Novel Parallel Scan for Multicore Processors and Its Application in Sparse Matrix-Vector Multiplication
Author
Zhang, Nan
Author_Institution
Dept. of Comput. Sci. & Software Eng., Xi´´an Jiaotong-Liverpool Univ., Suzhou, China
Volume
23
Issue
3
fYear
2012
fDate
3/1/2012 12:00:00 AM
Firstpage
397
Lastpage
404
Abstract
We present a novel parallel algorithm for computing the scan operations on x86 multicore processors. The existing best known parallel scan for the same platform requires the number of processors to be a power of two. But this constraint is removed from our proposed method. In the design of the algorithm architectural considerations for x86 multicore processors are given so that the rate of cache misses is reduced and the cost of thread synchronization and management is minimized. Results from tests made on a machine with dual-socket times quad-core Intel Xeon E5405 showed that the proposed solution outperformed the best known parallel reference. A novel approach to sparse matrix-vector multiplication (SpMV) based on the proposed scan is then explained. The approach, unlike the existing ones that make use of backward segmented operations, uses forward ones for more efficient caching. An implementation of the proposed SpMV was tested against the SpMV in Intel´s Math Kernel Library (MKL) and merits were found in the proposed approach.
Keywords
cache storage; matrix multiplication; multiprocessing systems; parallel algorithms; vectors; Intel Math Kernel Library; cache miss reduction; dual-socket -times quad-core Intel Xeon E5405; management cost minimization; parallel algorithm; parallel scan; sparse matrix-vector multiplication; thread synchronization cost minimization; x86 multicore processors; Algorithm design and analysis; Arrays; Instruction sets; Multicore processing; Software algorithms; Sparse matrices; Parallel algorithms; multicore computing; parallel scan; prefix sum; sparse matrix-vector multiplication.;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2011.174
Filename
5887318
Link To Document