DocumentCode :
3264467
Title :
Comparative architectural characterization of SPEC CPU2000 and CPU2006 benchmarks on the intel® Core™ 2 Duo processor
Author :
Kejariwal, Arun ; Veidenbaum, Alexander V. ; Nicolau, Alexandru ; Tian, Xinmin ; Girkar, Milind ; Saito, Hideki ; Banerjee, Utpal
Author_Institution :
Center for Embedded Comput. Syst., Univ. of California, Irvine, CA
fYear :
2008
fDate :
21-24 July 2008
Firstpage :
132
Lastpage :
141
Abstract :
SPEC CPU benchmarks are commonly used by compiler writers and architects of general purpose processors for performance evaluation. Since the release of the CPU89 suite, the SPEC CPU benchmark suites have evolved, with applications either removed or added or upgraded. This influences the design decisions for the next generation compilers and microarchitectures. In view of the above, it is critical to characterize the applications in the new suite - SPEC CPU2006 - to guide the decision making process. Although similar studies using the retired SPEC CPU benchmark suites have been done in the past, to the best of our knowledge, a thorough performance characterization of CPU2006 and its comparison with CPU2000 has not been done so far. In this paper, we present the above. For this, we compiled the applications in CPU2000 and CPU2006 using the Intelreg2 Fortran/C++ optimizing compiler and executed them, using the reference data sets, on the state-of-the-art Intel Coretrade2 Duo processor. The performance information was collected by using the Intel VTunetrade performance analyzer that takes advantage of the built-in hardware performance counters to obtain accurate information on program behavior and its use of processor resources. The focus of this paper is on branch and memory access behavior, the well-known reasons for program performance problems. By analyzing and comparing the L1 data and L2 cache miss rates, branch prediction accuracy, and resource stalls the performance impact in each suite is indirectly determined and described. Not surprisingly, the CPU2006 codes are larger, more complex, and have larger data sets. This leads to higher average L2 cache miss rates and a slight reduction in average IPC compared to the CPU2000 suite. Similarly, the average branch behavior is slightly worse in CPU2006 suite. However, based on processor stall counts branches are much less of a problem. The results presented here are a step towards understanding the SPEC CPU2- - 006 benchmarks and will aid compiler writers in understanding the impact of currently implemented optimizations and in the design of new ones to address the new challenges presented by SPEC CPU2006. Similar opportunities exist for architecture optimization.
Keywords :
cache storage; decision making; program compilers; Intel Core 2 Duo processor; L1 data; L2 cache miss rates; SPEC CPU2000; SPEC CPU2006; architecture optimization; branch prediction accuracy; built-in hardware performance counters; compilers; decision making; memory access behavior; processor resources; program behavior; Accuracy; Application software; Counting circuits; Decision making; Embedded computing; Hardware; Information analysis; Microarchitecture; Optimizing compilers; Performance analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Embedded Computer Systems: Architectures, Modeling, and Simulation, 2008. SAMOS 2008. International Conference on
Conference_Location :
Samos
Print_ISBN :
978-1-4244-1985-2
Type :
conf
DOI :
10.1109/ICSAMOS.2008.4664856
Filename :
4664856
Link To Document :
بازگشت