DocumentCode
704734
Title
Micro-architecture independent analytical processor performance and power modeling
Author
Van den Steen, Sam ; De Pestel, Sander ; Mechri, Moncef ; Eyerman, Stijn ; Carlson, Trevor ; Black-Schaffer, David ; Hagersten, Erik ; Eeckhout, Lieven
Author_Institution
Dept. of Electron. & Inf. Syst., Ghent Univ., Ghent, Belgium
fYear
2015
fDate
29-31 March 2015
Firstpage
32
Lastpage
41
Abstract
Optimizing processors for specific application(s) can substantially improve energy-efficiency. With the end of Dennard scaling, and the corresponding reduction in energyefficiency gains from technology scaling, such approaches may become increasingly important. However, designing applicationspecific processors require fast design space exploration tools to optimize for the targeted application(s). Analytical models can be a good fit for such design space exploration as they provide fast performance estimations and insight into the interaction between an application´s characteristics and the micro-architecture of a processor. Unfortunately, current analytical models require some microarchitecture dependent inputs, such as cache miss rates, branch miss rates and memory-level parallelism. This requires profiling the applications for each cache and branch predictor configuration, which is far more time-consuming than evaluating the actual performance models. In this work we present a micro-architecture independent profiler and associated analytical models that allow us to produce performance and power estimates across a large design space almost instantaneously. We show that using a micro-architecture independent profile leads to a speedup of 25× for our evaluated design space, compared to an analytical model that uses micro-architecture dependent profiles. Over a large design space, the model has a 13% error for performance and a 7% error for power, compared to cycle-level simulation. The model is able to accurately determine the optimal processor configuration for different applications under power or performance constraints, and it can provide insight into performance through cycle stacks.
Keywords
cache storage; computer architecture; microprocessor chips; power aware computing; branch miss rates; branch predictor configuration; cache miss rates; cache predictor configuration; cycle stacks; cycle-level simulation; energy-efficiency gains; fast design space exploration tools; memory-level parallelism; microarchitecture dependent inputs; microarchitecture independent analytical processor performance; microarchitecture independent profiler; performance constraints; power estimates; power modeling; processor microarchitecture; processor optimization; technology scaling; Analytical models; Computational modeling; Entropy; Estimation; Mathematical model; Parallel processing; Predictive models;
fLanguage
English
Publisher
ieee
Conference_Titel
Performance Analysis of Systems and Software (ISPASS), 2015 IEEE International Symposium on
Conference_Location
Philadelphia, PA
Type
conf
DOI
10.1109/ISPASS.2015.7095782
Filename
7095782
Link To Document