DocumentCode
668148
Title
Understanding the performance of stencil computations on Intel´s Xeon Phi
Author
Peraza, Joshua ; Tiwari, Anish ; Laurenzano, Michael ; Carrington, Laura ; Ward, William A. ; Campbell, Rick
Author_Institution
Univ. of California, San Diego, La Jolla, CA, USA
fYear
2013
fDate
23-27 Sept. 2013
Firstpage
1
Lastpage
5
Abstract
Accelerators are becoming prevalent in high performance computing as a way of achieving increased computational capacity within a smaller power budget. Effectively utilizing the raw compute capacity made available by these systems, however, remains a challenge because it can require a substantial investment of programmer time to port and optimize code to effectively use novel accelerator hardware. In this paper we present a methodology for isolating and modeling the performance of common performance-critical patterns of code (so-called idioms) and other relevant behavioral characteristics from large scale HPC applications which are likely to perform favorably on Intel Xeon Phi. The benefits of the methodology are twofold: (1) it directs programmer efforts toward the regions of code most likely to benefit from porting to the Xeon Phi and (2) provides speedup estimates for porting those regions of code. We then apply the methodology to the stencil idiom, showing performance improvements of up to a factor of 4.7× on stencil-based benchmark codes.
Keywords
parallel processing; performance evaluation; Intel Xeon Phi; code common performance-critical patterns; large scale HPC applications; stencil computations; stencil idiom; stencil-based benchmark codes; Acceleration; Atmospheric measurements; Benchmark testing; Computational modeling; Heating; Particle measurements; Three-dimensional displays;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location
Indianapolis, IN
Type
conf
DOI
10.1109/CLUSTER.2013.6702651
Filename
6702651
Link To Document