Title :
An Optimized Reduction Design to Minimize Atomic Operations in Shared Memory Multiprocessors
Author :
Speziale, Ettore ; Biagio, Andrea Di ; Agosta, Giovanni
Author_Institution :
Dipt. di Elettron. ed Inf., Politec. di Milano, Milan, Italy
Abstract :
Reduction operations play a key role in modern massively data parallel computation. However, current implementations in shared memory programming APIs such as OpenMP are often computation bottlenecks due to the high number of atomic operations involved. We propose a reduction design that exploits the coupling with a barrier synchronization to optimize the execution of the reduction. Experimental results show how the number of atomic operations involved is dramatically reduced, which can lead to significant improvement in scaling properties on large numbers of processing elements. We report a speedup of 1.53x on the 312.swim_m SPEC OMP2001 benchmark and a speedup of 4.02x on the streamcluster benchmark from the PARSEC suite over the baseline.
Keywords :
application program interfaces; message passing; parallel processing; shared memory systems; 312.swim_m SPEC OMP2001 benchmark; API; OpenMP; PARSEC suite; atomic operation minimisation; barrier synchronization; massively data parallel computation; optimized reduction design; scaling property; shared memory multiprocessor; shared memory programming; streamcluster benchmark; Aggregates; Benchmark testing; Containers; Instruction sets; Parallel processing; Payloads; Synchronization;
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-425-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2011.271