Title :
Distributed hardwired barrier synchronization for scalable multiprocessor clusters
Author :
Shang, Shisheng ; Hwang, Kai
Author_Institution :
Dept. of Comput. Eng., Kaohsiung Polytech. Inst., Taiwan
fDate :
6/1/1995 12:00:00 AM
Abstract :
Conventional multiprocessors mostly use centralized, memory-based barriers to synchronize concurrent processes created in multiple processors. These centralized barriers often become the bottleneck or hot spots in the shared memory. In this paper, we overcome the difficulty by presenting a distributed and hardwired barrier architecture, that is hierarchically constructed for fast synchronization in cluster-structured multiprocessors. The hierarchical architecture enables the scalability of cluster-structured multiprocessors. A special set of synchronization primitives is developed for explicit use of distributed barriers dynamically. To show the application of the hardwired barriers, we demonstrate how to synchronize Doall and Doacross loops using a limited number of hardwired barriers. Timing analysis shows an O(102) to O(105) reduction in synchronization overhead, compared with the use of software-controlled barriers implemented in a shared memory. The hardwired architecture is effective in implementing any partially ordered set of barriers or fuzzy barriers with extended synchronization regions. The versatility, scalability, programmability, and low overhead make the distributed barrier architecture attractive in constructing fine-grain, massively parallel MIMD systems using multiprocessor clusters with distributed shared memory
Keywords :
distributed memory systems; shared memory systems; synchronisation; concurrent processes synchronisation; distributed hardwired barrier synchronization; distributed shared memory; fuzzy barriers; hardwired barrier architecture; hot spots; massively parallel MIMD systems; memory-based barriers; programmability; scalability; scalable multiprocessor clusters; timing analysis; Application software; Buildings; Computer architecture; Fuzzy logic; Hardware; Integrated circuit interconnections; Memory architecture; Scalability; Timing; Wires;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on