Title :
Conflict-free accesses to strided vectors on a banked cache
Author :
Seznec, André ; Espasa, Roger
Author_Institution :
IRISA, Inst. Nat. de Recherche en Inf. et Autom., Rennes, France
fDate :
7/1/2005 12:00:00 AM
Abstract :
With the advance of integration technology, it has become feasible to implement a microprocessor, a vector unit, and a multimegabyte bank-interleaved L2 cache on a single die. Parallel access to strided vectors on the L2 cache is a major performance issue on such vector microprocessors. A major difficulty for such a parallel access is that one would like to interleave the cache on a block size basis in order to benefit from spatial locality and to maintain a low tag volume, while strided vector accesses naturally work on a word granularity. In this paper, we address this issue. Considering a parallel vector unit with 2n independent lanes, a 2n bank interleaved cache, and a cache line size of 2k words, we show that any slice of 2n+k consecutive elements of any strided vector with stride 2rR with R odd and r ≤ k can be accessed in the L2 cache and routed back to the lanes in 2k subslices of 2n elements.
Keywords :
cache storage; parallel memories; vector processor systems; bank-interleaved L2 cache; conflict free access; strided vectors; vector microprocessor; Bandwidth; Feeds; Memory management; Microprocessors; Scientific computing; Supercomputers; Index Terms- Vector microprocessor; L2 caches.; conflict free access; strided vectors;
Journal_Title :
Computers, IEEE Transactions on
DOI :
10.1109/TC.2005.110