DocumentCode :
572119
Title :
TLSync: Support for multiple fast barriers using on-chip transmission lines
Author :
Oh, Jungju ; Prvulovic, Milos ; Zajic, Alenka
Author_Institution :
Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2011
fDate :
4-8 June 2011
Firstpage :
105
Lastpage :
115
Abstract :
As the number of cores on a single-chip grows, scalable barrier synchronization becomes increasingly difficult to implement. In software implementations, such as the tournament barrier, a larger number of cores results in a longer latency for each round and a larger number of rounds. Hardware barrier implementations require significant dedicated wiring, e.g., using a reduction (arrival) tree and a notification (release) tree, and multiple instances of this wiring are needed to support multiple barriers (e.g., when concurrently executing multiple parallel applications). This paper presents TLSync, a novel hardware barrier implementation that uses the high-frequency part of the spectrum in a transmission-line broadcast network, thus leaving the transmission line network free for non-modulated (base-band) data transmission. In contrast to other implementations of hardware barriers, TLSync allows multiple thread groups to each have its own barrier. This is accomplished by allocating different bands in the radio-frequency spectrum to different groups. Our circuit-level and electromagnetic models show that the worst-case latency for a TLSync barrier is 4ns to 10ns, depending on the size of the frequency band allocated to each group, and our cycle-accurate architectural simulations show that low-latency TLSync barriers provide significant performance and scalability benefits to barrier-intensive applications.
Keywords :
circuit simulation; data communication; multi-threading; multiprocessor interconnection networks; network-on-chip; performance evaluation; radio networks; radiofrequency integrated circuits; synchronisation; transmission lines; trees (mathematics); wiring; barrier-intensive applications; circuit-level model; cycle-accurate architectural simulation; electromagnetic model; hardware barrier implementations; low-latency TLSync barriers; multiple fast barriers; multiple wiring instances; multithreading; nonmodulated data transmission; notification tree; on-chip transmission lines; radiofrequency spectrum; reduction tree; scalable barrier synchronization; transmission line broadcast network; worst-case latency; Delay; Frequency modulation; Power transmission lines; Receivers; Synchronization; Transmitters; Wires;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture (ISCA), 2011 38th Annual International Symposium on
Conference_Location :
San Jose, CA
ISSN :
1063-6897
Print_ISBN :
978-1-4503-0472-6
Type :
conf
Filename :
6306757
Link To Document :
بازگشت