DocumentCode :
3124705
Title :
A comparison of parallelization techniques for irregular reductions
fYear :
2001
fDate :
36982
Abstract :
A large class of scientific applications are comprised of irregular reductions on large data sets. On shared-memory multiprocessors these reductions are typically parallelized by computing partial results into replicated buffers, then combining the values into shared data using synchronization. Recently, a number of alternative techniques have been developed based on selective privatization, local writes, and synchronized writes. In this paper, we present a more efficient version of the local write algorithm which is 56% faster on average. We then experimentally compare the performance of each technique using a number of representative kernels. Results show speedups vary greatly depending on application characteristics such as connectivity, locality, and adaptivity. In general, we find the local write technique provides the best performance, particularly when applications display good locality
Keywords :
parallel programming; parallelising compilers; irregular reductions; local write technique; parallelization techniques; shared-memory multiprocessors; Application software; Computational fluid dynamics; Computer science; Concurrent computing; Data structures; Displays; Educational institutions; Kernel; Partial differential equations; Privatization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium., Proceedings 15th International
Conference_Location :
San Francisco, CA
ISSN :
1530-2075
Print_ISBN :
0-7695-0990-8
Type :
conf
DOI :
10.1109/IPDPS.2001.924963
Filename :
924963
Link To Document :
بازگشت