DocumentCode :
167442
Title :
Dymaxion++: A Directive-Based API to Optimize Data Layout and Memory Mapping for Heterogeneous Systems
Author :
Shuai Che ; Jiayuan Meng ; Skadron, Kevin
fYear :
2014
fDate :
19-23 May 2014
Firstpage :
916
Lastpage :
924
Abstract :
There has been a growing trend in using heterogeneous systems with CPUs and GPUs to solve diverse compute problems. However, high application performance on these platforms relies on efficient memory accesses. For many applications, CPUs and GPUs prefer different memory mappings and data structure layouts. This in turn requires developers to use device-specific strategies for memory access optimizations. Achieving both code and performance portability becomes a challenge for heterogeneous computing. This paper proposes a directive-based API, Dymaxion++, which enables programmers to optimize memory access patterns across devices with a simple interface. Use of Dymaxion++ requires only minimal modifications to existing codes with a small set of pragma extensions. The current framework augments the original Dymaxion framework with a clean abstraction backed by a source-to-source code translator. Dymaxion++ also provides additional programming features to map data structures to GPU´s hybrid memory spaces (e.g. texture and constant memory) for different uses. Additionally, data layout transformation is enabled while exchanging data between GPU scratchpad and device memory as well as between system memory and device memory.
Keywords :
application program interfaces; data structures; distributed processing; electronic data interchange; graphics processing units; CPU; Dymaxion++; GPU hybrid memory spaces; GPU scratchpad; clean abstraction; data exchange; data layout optimization; data structure layouts; data structures; device memory; device-specific strategies; directive-based API; heterogeneous computing; heterogeneous systems; memory access optimizations; memory access patterns; memory accesses; memory mappings; source-to-source code translator; system memory; Arrays; Graphics processing units; Instruction sets; Kernel; Layout; Programming;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-4117-9
Type :
conf
DOI :
10.1109/IPDPSW.2014.104
Filename :
6969480
Link To Document :
بازگشت