Understanding the performance of stencil computations on Intel´s Xeon Phi

Author

Peraza, Joshua ; Tiwari, Anish ; Laurenzano, Michael ; Carrington, Laura ; Ward, William A. ; Campbell, Rick

Author_Institution

Univ. of California, San Diego, La Jolla, CA, USA

fYear

2013

fDate

23-27 Sept. 2013

Firstpage

1

Lastpage

5

Abstract

Accelerators are becoming prevalent in high performance computing as a way of achieving increased computational capacity within a smaller power budget. Effectively utilizing the raw compute capacity made available by these systems, however, remains a challenge because it can require a substantial investment of programmer time to port and optimize code to effectively use novel accelerator hardware. In this paper we present a methodology for isolating and modeling the performance of common performance-critical patterns of code (so-called idioms) and other relevant behavioral characteristics from large scale HPC applications which are likely to perform favorably on Intel Xeon Phi. The benefits of the methodology are twofold: (1) it directs programmer efforts toward the regions of code most likely to benefit from porting to the Xeon Phi and (2) provides speedup estimates for porting those regions of code. We then apply the methodology to the stencil idiom, showing performance improvements of up to a factor of 4.7× on stencil-based benchmark codes.

Keywords

parallel processing; performance evaluation; Intel Xeon Phi; code common performance-critical patterns; large scale HPC applications; stencil computations; stencil idiom; stencil-based benchmark codes; Acceleration; Atmospheric measurements; Benchmark testing; Computational modeling; Heating; Particle measurements; Three-dimensional displays;

fLanguage

English

Publisher

ieee

Conference_Titel

Cluster Computing (CLUSTER), 2013 IEEE International Conference on

Conference_Location

Indianapolis, IN

Type

conf

DOI

10.1109/CLUSTER.2013.6702651

Filename

6702651