DocumentCode :
3621006
Title :
Efficient Synchronization for Nonuniform Communication Architectures
Author :
Z. Radovic;E. Hagersten
Author_Institution :
Uppsala University
fYear :
2002
fDate :
6/24/1905 12:00:00 AM
Firstpage :
13
Lastpage :
13
Abstract :
Scalable parallel computers are often nonuniform communication architectures (NUCAs), where the access time to other processor’s caches vary with their physical location. Still, few attempts of exploring cache-to-cache communication locality have been made. This paper introduces a new kind of synchronization primitives (lock-unlock) that favor neighboring processors when a lock is released. This improves the lock handover time as well as access time to the shared data of the critical region. A critical section guarded by our new RH lock takes less than half the time to execute compared with the same critical section guarded by any other lock on our NUCA hardware. The execution time for Raytrace with 28 processors was improved 2.23 - 4.68 times, while global traffic was dramatically decreased compared with all the other locks. The average execution time was improved 7 - 24% while the global traffic was decreased 8 - 28% for an average over the seven applications studied.
Keywords :
"Traffic control","Computer architecture","Yarn","Testing","Scalability","Performance evaluation","Information technology","Concurrent computing","Physics computing","Hardware"
Publisher :
ieee
Conference_Titel :
Supercomputing, ACM/IEEE 2002 Conference
ISSN :
1063-9535
Print_ISBN :
0-7695-1524-X
Type :
conf
DOI :
10.1109/SC.2002.10038
Filename :
1592849
Link To Document :
بازگشت