Title :
Data forwarding in scalable shared-memory multiprocessors
Author :
Koufaty, David A. ; Chen, Xiangfeng ; Poulsen, David K. ; Torrellas, Josep
Author_Institution :
Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
fDate :
12/1/1996 12:00:00 AM
Abstract :
Scalable shared-memory multiprocessors are often slowed down by long-latency memory accesses. One way to cope with this problem is to use data forwarding to overlap memory accesses with computation. With data forwarding, when a processor produces a datum, in addition to updating its cache, it sends a copy of the datum to the caches of the processors that the compiler identified as consumers of it. As a result, when the consumer processors access the datum, they find it in their caches. This paper addresses two main issues. First, it presents a framework for a compiler algorithm for forwarding. Second, using address traces, it evaluates the performance impact of different levels of support for forwarding. Our simulations of a 32-processor machine show that an optimistic support for forwarding speeds up five applications by an average of 50% for large caches and 30% for small caches. For large caches, most sharing read misses are eliminated, while for small caches, forwarding does not increase the number of conflict misses significantly. Overall, support for forwarding in shared-memory multiprocessors promises to deliver good application speedups
Keywords :
cache storage; digital simulation; fault tolerant computing; shared memory systems; address traces; compiler algorithm; consumer processors; data forwarding; long-latency memory accesses; performance impact; scalable shared-memory multiprocessors; Delay; Graphics; Hardware; Multithreading; Optimizing compilers; Prefetching; Protocols; Research and development; Runtime; Silicon;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on