DocumentCode :
2798112
Title :
Making Sequential Consistency Practical in Titanium
Author :
Kamil, Amir ; Su, Jimmy ; Yelick, Katherine
Author_Institution :
University of California, Berkeley
fYear :
2005
fDate :
12-18 Nov. 2005
Firstpage :
15
Lastpage :
15
Abstract :
The memory consistency model in shared memory parallel programming controls the order in which memory operations performed by one thread may be observed by another. The most natural model for programmers is to have memory accesses appear to take effect in the order specified in the original program. Language designers have been reluctant to use this strong semantics, called sequential consistency, due to concerns over the performance of memory fence instructions and related mechanisms that guarantee order. In this paper, we provide evidence for the practicality of sequential consistency by showing that advanced compiler analysis techniques are sufficient to eliminate the need for most memory fences and enable high-level optimizations. Our analyses eliminated over 97% of the memory fences that were needed by a na¨ýve implementation, accounting for 87 to 100% of the dynamically encountered fences in all but one benchmark. The impact of the memory model and analysis on runtime performance depends on the quality of the optimizations: more aggressive optimizations are likely to be invalidated by a strong memory consistency semantics. We consider two specific optimizations pipelining of bulk memory copies and communication aggregation and scheduling for irregular accesses and show that our most aggressive analysis is able to obtain the same performance as the relaxed model when applied to two linear algebra kernels. While additional work on parallel optimizations and analyses is needed, we believe these results provide important evidence on the viability of using a simple memory consistency model without sacrificing performance.
Keywords :
Kernel; Linear algebra; Optimizing compilers; Parallel programming; Performance analysis; Pipeline processing; Programming profession; Runtime; Titanium; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, 2005. Proceedings of the ACM/IEEE SC 2005 Conference
Print_ISBN :
1-59593-061-2
Type :
conf
DOI :
10.1109/SC.2005.43
Filename :
1559967
Link To Document :
بازگشت