DocumentCode
598583
Title
Tiling stencil computations to maximize parallelism
Author
Bandishti, V. ; Pananilath, I. ; Bondhugula, Uday
Author_Institution
Dept. of Comput. Sci. & Autom., Indian Inst. of Sci., Bangalore, India
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
1
Lastpage
11
Abstract
Most stencil computations allow tile-wise concurrent start, i.e., there always exists a face of the iteration space and a set of tiling hyperplanes such that all tiles along that face can be started concurrently. This provides load balance and maximizes parallelism. However, existing automatic tiling frameworks often choose hyperplanes that lead to pipelined start-up and load imbalance. We address this issue with a new tiling technique that ensures concurrent start-up as well as perfect load-balance whenever possible. We first provide necessary and sufficient conditions on tiling hyperplanes to enable concurrent start for programs with affine data accesses. We then provide an approach to find such hyperplanes. Experimental evaluation on a 12-core Intel Westmere shows that our code is able to outperform a tuned domain-specific stencil code generator by 4% to 27%, and previous compiler techniques by a factor of 2× to 10.14×.
Keywords
concurrency control; pipeline processing; program compilers; resource allocation; Intel Westmere; affine data accesses; automatic tiling frameworks; load balance; load imbalance; necessary and sufficient conditions; pipelined start-up; tile-wise concurrent start-up; tiling hyperplanes; tiling stencil computations; tuned domain-specific stencil code generator; Equations; Face; Parallel processing; Schedules; Silicon; Tiles; Vectors; Compilers; Program transformation;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
Conference_Location
Salt Lake City, UT
ISSN
2167-4329
Print_ISBN
978-1-4673-0805-2
Type
conf
DOI
10.1109/SC.2012.107
Filename
6468470
Link To Document