Title :
Understanding the performance of stencil computations on Intel´s Xeon Phi
Author :
Peraza, Joshua ; Tiwari, Anish ; Laurenzano, Michael ; Carrington, Laura ; Ward, William A. ; Campbell, Rick
Author_Institution :
Univ. of California, San Diego, La Jolla, CA, USA
Abstract :
Accelerators are becoming prevalent in high performance computing as a way of achieving increased computational capacity within a smaller power budget. Effectively utilizing the raw compute capacity made available by these systems, however, remains a challenge because it can require a substantial investment of programmer time to port and optimize code to effectively use novel accelerator hardware. In this paper we present a methodology for isolating and modeling the performance of common performance-critical patterns of code (so-called idioms) and other relevant behavioral characteristics from large scale HPC applications which are likely to perform favorably on Intel Xeon Phi. The benefits of the methodology are twofold: (1) it directs programmer efforts toward the regions of code most likely to benefit from porting to the Xeon Phi and (2) provides speedup estimates for porting those regions of code. We then apply the methodology to the stencil idiom, showing performance improvements of up to a factor of 4.7× on stencil-based benchmark codes.
Keywords :
parallel processing; performance evaluation; Intel Xeon Phi; code common performance-critical patterns; large scale HPC applications; stencil computations; stencil idiom; stencil-based benchmark codes; Acceleration; Atmospheric measurements; Benchmark testing; Computational modeling; Heating; Particle measurements; Three-dimensional displays;
Conference_Titel :
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location :
Indianapolis, IN
DOI :
10.1109/CLUSTER.2013.6702651