Title :
Analytic evaluation of shared-memory architectures
Author :
Sorin, Daniel J. ; Lemon, Jonathan L. ; Eager, Derek L. ; Vernon, Mary K.
Author_Institution :
Dept. of Electr. & Comput. Eng., Duke Univ., Durham, NC, USA
fDate :
2/1/2003 12:00:00 AM
Abstract :
This paper develops and validates an efficient analytical model for evaluating the performance of shared memory architectures with ILP processors. First, we instrument the SimOS simulator to measure the parameters for such a model and we find a surprisingly high degree of processor memory request heterogeneity in the workloads. Examining the model parameters provides insight into application behaviors and how they interact with the system. Second, we create a model that captures such heterogeneous processor behavior, which is important for analyzing memory system design tradeoffs. Highly bursty memory request traffic and lock contention are also modeled in a significantly more robust way than in previous work. With these features, the model is applicable to a wide range of architectures and applications. Although the features increase the model complexity, it is a useful design tool because the size of the model input parameter set remains manageable, and the model is still several orders of magnitude quicker to solve than detailed simulation. Validation results show that the model is highly accurate, producing heterogeneous per processor throughputs that are generally within 5 percent and, for the workloads validated, always within 13 percent of the values measured by detailed simulation with SimOS. Several examples illustrate applications of the model to studying architectural design issues and the interactions between the architecture and the application workloads.
Keywords :
computational complexity; digital simulation; performance evaluation; shared memory systems; ILP processors; SimOS simulator; analytic evaluation; application behaviors; model complexity; performance evaluation; processor memory request; shared-memory architectures; Analytical models; Computational modeling; Computer Society; Computer architecture; Instruments; Memory architecture; Performance analysis; Robustness; Throughput; Traffic control;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
DOI :
10.1109/TPDS.2003.1178880