DocumentCode :
2054106
Title :
The memory of bandwidth bottleneck and its amelioration by a compiler
Author :
Ding, Chen ; Kennedy, Ken
Author_Institution :
Rice Univ., Houston, TX, USA
fYear :
2000
fDate :
2000
Firstpage :
181
Lastpage :
189
Abstract :
As the speed gap between CPU and memory widens, memory hierarchy has become the primary factor limiting program performance. Until now, the principal focus of hardware and software innovations has been overcoming latency. However, the advent of latency tolerance techniques such as non-blocking cache and software prefetching begins the process of trading bandwidth for latency by overlapping and pipelining memory transfers. Since actual latency is the inverse of the consumed bandwidth, memory latency cannot be fully tolerated without infinite bandwidth. This perspective has led us to two questions. Do current machines provide sufficient data bandwidth? If not, can a program be restructured to consume less bandwidth? This paper answers these questions in two parts. The first part defines a new bandwidth-based performance model and demonstrates the serious performance bottleneck due to the lack of memory bandwidth. The second part describes a new set of compiler optimizations for reducing bandwidth consumption of programs
Keywords :
optimising compilers; software performance evaluation; bandwidth bottleneck memory; bandwidth-based performance model; compiler optimizations; latency; memory hierarchy; performance bottleneck; program performance; software prefetching; Bandwidth; Delay; Electronic switching systems; Lapping; Optimizing compilers; Pipeline processing; Program processors; Read-write memory; Technological innovation; Tellurium;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2000. IPDPS 2000. Proceedings. 14th International
Conference_Location :
Cancun
Print_ISBN :
0-7695-0574-0
Type :
conf
DOI :
10.1109/IPDPS.2000.845980
Filename :
845980
Link To Document :
بازگشت