Title :
The impact of instruction-level parallelism on multiprocessor performance and simulation methodology
Author :
Pai, Vijay S. ; Ranganathan, Parthasarathy ; Adve, Sarita V.
Author_Institution :
Dept. of Electr. & Comput. Eng., Rice Univ., Houston, TX, USA
Abstract :
Current microprocessors exploit high levels of instruction-level parallelism (ILP) through techniques such as multiple issue, dynamic scheduling, and non-blocking reads. This paper presents the first detailed analysis of the impact of such processors on shared-memory multiprocessors using a detailed execution-driven simulator. Using this analysis, we also examine the validity of common direct-execution simulation techniques that employ previous-generation processor models to approximate ILP-based multiprocessors. We find that ILP techniques substantially reduce CPU time in multiprocessors, but are less effective in reducing memory stall time. Consequently, despite the presence of inherent latency-tolerating techniques in ILP processors, memory stall time becomes a larger component of execution time and parallel efficiencies are generally poorer in ILP-based multiprocessors than in previous-generation multiprocessors. Examining the validity of direct-execution simulators with previous-generation processor models, we find that, with appropriate approximations, such simulators can reasonably characterize the behavior of applications with poor overlap of read misses. However, they can be highly inaccurate for applications with high overlap of read misses. For our applications, the errors in execution time with these simulators range from 26% to 192% for the most commonly used model, and from -8% to 73% for the most accurate model
Keywords :
discrete event simulation; instruction sets; processor scheduling; shared memory systems; direct-execution simulation; direct-execution simulators; dynamic scheduling; execution-driven simulator; instruction-level parallelism; multiprocessor performance; nonblocking reads; processor models; shared-memory multiprocessors; simulation methodology; Analytical models; Computational modeling; Computer simulation; Delay; Dynamic scheduling; Microprocessors; Out of order; Parallel processing; Performance analysis; Processor scheduling;
Conference_Titel :
High-Performance Computer Architecture, 1997., Third International Symposium on
Conference_Location :
San Antonio, TX
Print_ISBN :
0-8186-7764-3
DOI :
10.1109/HPCA.1997.569611