DocumentCode :
58828
Title :
Automatic Generation of Miniaturized Synthetic Proxies for Target Applications to Efficiently Design Multicore Processors
Author :
Ganesan, Kavita ; John, Lizy Kurian
Author_Institution :
Oracle Inc., Austin, TX, USA
Volume :
63
Issue :
4
fYear :
2014
fDate :
Apr-14
Firstpage :
833
Lastpage :
846
Abstract :
Prohibitive simulation time with pre-silicon design models and unavailability of proprietary target applications make microprocessor design very tedious. The framework proposed in this paper is the first attempt to automatically generate synthetic benchmark proxies for real world multithreaded applications. The framework includes metrics that characterize the behavior of the workloads in the shared caches, coherence logic, out-of-order cores, interconnection network and DRAM. The framework is evaluated by generating proxies for the workloads in the multithreaded PARSEC benchmark suite and validating their fidelity by comparing the microarchitecture dependent and independent metrics to that of the original workloads. The average error in IPC is 4.87 percent and maximum error is 10.8 percent for Raytrace in comparison to the original workloads. The average error in the power-per-cycle metric is 2.73 percent with a maximum of 5.5 percent when compared to original workloads. The representativeness of the proxies to that of the original workloads in terms of their sensitivity to design changes is evaluated by finding the correlation coefficient between the trends followed by the synthetic and the original for design changes in IPC, which is 0.92. A speedup of four to six orders of magnitude is achieved by using the synthetic proxies over the original workloads.
Keywords :
DRAM chips; cache storage; microprocessor chips; multi-threading; multiprocessing systems; multiprocessor interconnection networks; DRAM; Raytrace; coherence logic; interconnection network; microprocessor design; miniaturized synthetic proxies; multicore processors; multithreaded applications; out-of-order cores; presilicon design models; prohibitive simulation; shared caches; synthetic benchmark proxies; Benchmark testing; Cloning; Measurement; Memory management; Multicore processing; Parallel processing; Power demand; Computer architecture; multicore systems; synthetic benchmarks; workload cloning and power modeling;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2013.36
Filename :
6463386
Link To Document :
بازگشت