DocumentCode
3145351
Title
An Optimized Reduction Design to Minimize Atomic Operations in Shared Memory Multiprocessors
Author
Speziale, Ettore ; Biagio, Andrea Di ; Agosta, Giovanni
Author_Institution
Dipt. di Elettron. ed Inf., Politec. di Milano, Milan, Italy
fYear
2011
fDate
16-20 May 2011
Firstpage
1300
Lastpage
1309
Abstract
Reduction operations play a key role in modern massively data parallel computation. However, current implementations in shared memory programming APIs such as OpenMP are often computation bottlenecks due to the high number of atomic operations involved. We propose a reduction design that exploits the coupling with a barrier synchronization to optimize the execution of the reduction. Experimental results show how the number of atomic operations involved is dramatically reduced, which can lead to significant improvement in scaling properties on large numbers of processing elements. We report a speedup of 1.53x on the 312.swim_m SPEC OMP2001 benchmark and a speedup of 4.02x on the streamcluster benchmark from the PARSEC suite over the baseline.
Keywords
application program interfaces; message passing; parallel processing; shared memory systems; 312.swim_m SPEC OMP2001 benchmark; API; OpenMP; PARSEC suite; atomic operation minimisation; barrier synchronization; massively data parallel computation; optimized reduction design; scaling property; shared memory multiprocessor; shared memory programming; streamcluster benchmark; Aggregates; Benchmark testing; Containers; Instruction sets; Parallel processing; Payloads; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location
Shanghai
ISSN
1530-2075
Print_ISBN
978-1-61284-425-1
Electronic_ISBN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2011.271
Filename
6008982
Link To Document