DocumentCode :
1242101
Title :
Evaluating performance tradeoffs between fine-grained and coarse-grained alternatives
Author :
Dubey, Pradeep IS ; Adams, George B., III ; Flynn, M.J.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
6
Issue :
1
fYear :
1995
fDate :
1/1/1995 12:00:00 AM
Firstpage :
17
Lastpage :
27
Abstract :
Recent simulation based studies suggest that while superpipelines and superscalars are equally capable of exploiting fine grained concurrency, multiprocessors are better at exploiting coarse grained parallelism. An analytical model that is more flexible and less costly in terms of run time than simulation, is proposed as a tool for analyzing the tradeoff between superpipelined processors, superscalar processors, and multiprocessors. The duality of superpipelines and superscalars is examined in detail. The performance limit for these systems has been derived and it supports the fetch bottleneck observation of previous researchers. Common characteristics of utilization curves for such systems are examined. Combined systems, such as superpipelined multiprocessors and superscalar multiprocessors, are also analyzed. The model shows that the number of pipelines (or processors) at which the maximum throughput is obtained is, as memory access time increases, increasingly sensitive to the ratio of memory access time to network access delay. Further, as a function of interiteration dependence distance, optimum throughput is shown to vary nonlinearly, whereas the corresponding optimum number of processors varies linearly. The predictions from the analytical model agree with similar results published using simulation based techniques
Keywords :
multiprocessing systems; parallel architectures; performance evaluation; pipeline processing; coarse-grained alternatives; duality; fetch bottleneck observation; interiteration dependence distance; memory access time; multiprocessors; network access delay; performance limit; performance tradeoffs; simulation based studies; superpipelines; superscalar processors; utilization curves; Analytical models; Assembly; Clocks; Concurrent computing; Delay effects; Microprocessors; NASA; Pipelines; Predictive models; Throughput;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.363414
Filename :
363414
Link To Document :
بازگشت