Title :
Flexible compiler-managed L0 buffers for clustered VLIW processors
Author :
Gibert, Enric ; Sánchez, Jesús ; González, Antonio
Author_Institution :
Dept. of Comput. Archit., Univ. Politecnica de Catalunya, Barcelona, Spain
Abstract :
Wire delays are a major concern for current and forthcoming processors. One approach to attack this problem is to divide the processor into semi-independent units referred to as clusters. A cluster usually consists of a local register file and a subset of the functional units, while the data cache remains centralized. However, as technology evolves, the latency of such a centralized cache increase leading to an important performance impact. In this paper, we propose to include flexible low-latency buffers in each cluster in order to reduce the performance impact of higher cache latencies. The reduced number of entries in each buffer permits the design of flexible ways to map data from L1 to these buffers. The proposed L0 buffers are managed by the compiler, which is responsible to decide which memory instructions make us of them. Effective instruction scheduling techniques are proposed to generate code that exploits these buffers. Results for the Mediabench benchmark suite show that the performance of a clustered VLIW processor with a unified L1 data cache is improved by 16% when such buffers are used. In addition, the proposed architecture also shows significant advantages over both MultiVLIW processors and clustered processors with a word-interleaved cache, two state-of-the-art designs with a distributed L1 data cache.
Keywords :
cache storage; distributed memory systems; instruction sets; parallel architectures; processor scheduling; program compilers; Mediabench benchmark; MultiVLIW processors; cache latencies; clustered VLIW processors; compiler-managed L0 buffers; data caches; instruction scheduling; local register file; memory instructions; very long instruction word; wire delays; Computational modeling; Computer architecture; Delay; Electronic mail; Energy consumption; Filters; Microarchitecture; Processor scheduling; VLIW; Wire;
Conference_Titel :
Microarchitecture, 2003. MICRO-36. Proceedings. 36th Annual IEEE/ACM International Symposium on
Print_ISBN :
0-7695-2043-X
DOI :
10.1109/MICRO.2003.1253205