Title :
Optimal skewed tiling for cache locality enhancement
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
Abstract :
Iterative stencil loops are used in scientific programs to implement relaxation methods for numerical simulation and signal processing. Such loops iteratively modify the same array elements over different time steps. Hence, exploitation of temporal data locality can lead to significantly improved cache performance. This paper shows that, to optimally tile iterative stencil loops, the imperfectly nested inner loops must be realigned such that they can be minimally skewed across different time steps. A memory-reference cost analysis proves that the number of cache misses is minimized when the skewing is minimum. A graph-theoretical algorithm, which takes polynomial time, is presented to determine the minimum skew factors for a given nesting of iterative stencil loops.
Keywords :
cache storage; computational complexity; data structures; graph theory; natural sciences computing; parallel programming; program control structures; relaxation theory; signal processing; software performance evaluation; array elements; cache locality enhancement; cache misses; cache performance; graph-theoretical algorithm; imperfectly nested inner loops; iterative stencil loops; memory-reference cost analysis; numerical simulation; optimal skewed tiling; polynomial time; relaxation methods; scientific programs; signal processing; temporal data locality; Array signal processing; Costs; Iterative algorithms; Iterative methods; Numerical simulation; Polynomials; Relaxation methods; Signal processing algorithms; Testing; Tiles;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
Print_ISBN :
0-7695-1926-1
DOI :
10.1109/IPDPS.2003.1213120