Title :
Instruction-level characterization of scientific computing applications using hardware performance counters
Author :
Luo, Yong ; Cameron, Kirk W.
Author_Institution :
Sci. Comput. Group, Los Alamos Nat. Lab., NM, USA
Abstract :
The paper provides characterization methods based on empirical performance counter measurements. In particular, we provide an instruction-level characterization derived empirically in an effort to demonstrate how architectural limitations in underlying hardware will affect the performance of existing codes. Preliminary results provide promise in code characterization, and empirical/analytical modeling. These include the ability to quantify outstanding miss utilization and stall time attributable to architectural limitations in the CPU and the memory hierarchy. This work further promises insight into quantifying bounds for CPI0 or the ideal CPI with infinite, perfect L1 cache. In general, if we can characterize workloads using parameters that are independent of architecture, such as this work, then we can more appropriately compare different architectures in an effort to direct processor/code development
Keywords :
cache storage; computer architecture; instruction sets; natural sciences computing; performance evaluation; CPI0; architectural limitations; characterization methods; code characterization; empirical performance counter measurements; empirical/analytical modeling; hardware performance counters; ideal CPI; instruction-level characterization; memory hierarchy; outstanding miss utilization; perfect L1 cache; processor/code development; scientific computing applications; stall time; Counting circuits; Hardware; Kirk field collapse effect; Laboratories; Microprocessors; Pipelines; Postal services; Power system modeling; Process design; Scientific computing;
Conference_Titel :
Workload Characterization: Methodology and Case Studies, 1999
Conference_Location :
Dallas, TX
Print_ISBN :
0-7695-0450-7
DOI :
10.1109/WWC.1998.809368