DocumentCode :
186390
Title :
Extending the roofline model: Bottleneck analysis with microarchitectural constraints
Author :
Cabezas, Victoria Caparros ; Puschel, Markus
Author_Institution :
Dept. of Comput. Sci., ETH Zurich, Zurich, Switzerland
fYear :
2014
fDate :
26-28 Oct. 2014
Firstpage :
222
Lastpage :
231
Abstract :
Software, even if carefully optimized, rarely reaches the peak performance of a processor. Understanding which hardware resource is the bottleneck is difficult but important as it can help with both further optimizing the code or deciding which hardware component to upgrade for higher performance. If the bottleneck is the memory bandwidth, the roofline model provides a simple but instructive analysis and visualization. In this paper, we take the roofline analysis further by including additional performance-relevant hardware features such as latency, throughput, capacity information for a multilevel cache hierarchy and out-of-order execution buffers. Two key ideas underlie our analysis. First, we estimate performance based on a scheduling of the computation DAG on a high-level model of a microarchitecture and extract data including utilization of resources and overlaps from a cycle-by-cycle analysis of the schedule. Second, we show how to use this data to create only one plot with multiple rooflines that visualize performance bottlenecks. We validate our model against performance data obtained from a real system, and then apply our bottleneck analysis to a number of floating-point kernels to identify and interpret bottlenecks.
Keywords :
cache storage; directed graphs; multiprocessing systems; storage management; DAG; bottleneck analysis; floating-point kernel; memory bandwidth; microarchitectural constraint; multilevel cache hierarchy; out-of-order execution buffer; performance-relevant hardware feature; roofline model; Analytical models; Bandwidth; Computational modeling; Data models; Hardware; Microarchitecture; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Workload Characterization (IISWC), 2014 IEEE International Symposium on
Conference_Location :
Raleigh, NC
Print_ISBN :
978-1-4799-6452-9
Type :
conf
DOI :
10.1109/IISWC.2014.6983061
Filename :
6983061
Link To Document :
بازگشت