DocumentCode :
1997531
Title :
Offload Compiler Runtime for the Intel® Xeon Phi Coprocessor
Author :
Newburn, Chris J. ; Dmitriev, Serguei ; Narayanaswamy, Renukaprasad ; Wiegert, John ; Murty, Rohan ; Chinchilla, Francisco ; Deodhar, R. ; McGuire, Robert
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
1213
Lastpage :
1225
Abstract :
The Intel® Xeon Phi coprocessor platform has a new software stack that enables new programming models. One such model is offload of computation from a host processor to a coprocessor that is a fully-capable Intel® Architecture CPU, namely, the Intel® Xeon Phi coprocessor. The purpose of that offload is to improve response time and/or throughput. This paper presents the compiler offload software runtime infrastructure for the Intel® Xeon Phi coprocessor, which includes a production C/C++ and Fortran compiler that enables offload to that coprocessor, and an underlying Intel® Many Integrated Core (Intel® MIC) platform software stack that enables offloading. The paper shares the insights that grow out of the experience of a multi-year, intensive development effort. It addresses end users´ questions about offload with the compiler offload runtime, namely, why offload to a co-processor is useful, how it is specified, and what the conditions for the profitability of offload are. It also serves as a guide to potential third-party developers of offload runtimes, such as a gcc-based offload compiler, ports of existing commercial offloading compilers to Intel® Xeon Phi coprocessor such as CAPS®, and third-party offload library vendors that Intel is working with, such as NAG® and MAGMA®. It describes the software architecture and design of the offload compiler runtime. It enumerates the key performance features for this heterogeneous computing stack, related to initializa-tion, data movement and invocation. Finally, it evaluates the performance impact of those features for a set of directed micro-benchmarks and larger workloads.
Keywords :
coprocessors; program compilers; Fortran compiler; Intel Xeon Phi coprocessor platform; Intel architecture CPU; compiler offload software runtime infrastructure; data movement; directed microbenchmarks; heterogeneous computing stack; integrated core platform software stack; offload compiler runtime; profitability; programming models; software architecture; third party developers; third party offload library vendors; Computational modeling; Coprocessors; Kernel; Libraries; Programming; Runtime; MIC; Xeon Phi; compiler; coprocessor; heterogeneous; offload; runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.251
Filename :
6651008
Link To Document :
بازگشت