• DocumentCode
    1556564
  • Title

    A Novel Parallel Scan for Multicore Processors and Its Application in Sparse Matrix-Vector Multiplication

  • Author

    Zhang, Nan

  • Author_Institution
    Dept. of Comput. Sci. & Software Eng., Xi´´an Jiaotong-Liverpool Univ., Suzhou, China
  • Volume
    23
  • Issue
    3
  • fYear
    2012
  • fDate
    3/1/2012 12:00:00 AM
  • Firstpage
    397
  • Lastpage
    404
  • Abstract
    We present a novel parallel algorithm for computing the scan operations on x86 multicore processors. The existing best known parallel scan for the same platform requires the number of processors to be a power of two. But this constraint is removed from our proposed method. In the design of the algorithm architectural considerations for x86 multicore processors are given so that the rate of cache misses is reduced and the cost of thread synchronization and management is minimized. Results from tests made on a machine with dual-socket times quad-core Intel Xeon E5405 showed that the proposed solution outperformed the best known parallel reference. A novel approach to sparse matrix-vector multiplication (SpMV) based on the proposed scan is then explained. The approach, unlike the existing ones that make use of backward segmented operations, uses forward ones for more efficient caching. An implementation of the proposed SpMV was tested against the SpMV in Intel´s Math Kernel Library (MKL) and merits were found in the proposed approach.
  • Keywords
    cache storage; matrix multiplication; multiprocessing systems; parallel algorithms; vectors; Intel Math Kernel Library; cache miss reduction; dual-socket -times quad-core Intel Xeon E5405; management cost minimization; parallel algorithm; parallel scan; sparse matrix-vector multiplication; thread synchronization cost minimization; x86 multicore processors; Algorithm design and analysis; Arrays; Instruction sets; Multicore processing; Software algorithms; Sparse matrices; Parallel algorithms; multicore computing; parallel scan; prefix sum; sparse matrix-vector multiplication.;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2011.174
  • Filename
    5887318