مرکز منطقه ای اطلاع رساني علوم و فناوري - MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-based Systems

DocumentCode :

1827894

Title :

MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-based Systems

Author :

Aji, Ashwin M. ; Dinan, James ; Buntinas, Darius ; Balaji, Pavan ; Feng, Wu-chun ; Bisset, Keith R. ; Thakur, Rajeev

Author_Institution :

Dept. of Comput. Sci., Virginia Tech., Blacksburg, VA, USA

fYear :

2012

fDate :

25-27 June 2012

Firstpage :

647

Lastpage :

654

Abstract :

Data movement in high-performance computing systems accelerated by graphics processing units (GPUs) remains a challenging problem. Data communication in popular parallel programming models, such as the Message Passing Interface (MPI), is currently limited to the data stored in the CPU memory space. Auxiliary memory systems, such as GPU memory, are not integrated into such data movement frameworks, thus providing applications with no direct mechanism to perform end-to-end data movement. We introduce MPI-ACC, an integrated and extensible framework that allows end-to-end data movement in accelerator-based systems. MPI-ACC provides productivity and performance benefits by integrating support for auxiliary memory spaces into MPI. MPI-ACC´s runtime system enables several key optimizations, including pipelining of data transfers and balancing of communication based on accelerator and node architecture. We demonstrate the extensible design of MPIACC by using the popular CUDA and OpenCL accelerator programming interfaces. We examine the impact of MPI-ACC on communication performance and evaluate application-level benefits on a large-scale epidemiology simulation.

Keywords :

application program interfaces; data communication; graphics processing units; optimisation; parallel architectures; parallel programming; pipeline processing; storage management; CPU memory space; CUDA; GPU; MPI-ACC runtime system; OpenCL accelerator programming interfaces; accelerator-based communication; accelerator-based systems; auxiliary memory systems; data communication; data movement; data storage; data transfers pipelining; end-to-end data movement; graphics processing units; high-performance computing systems; large-scale epidemiology simulation; node architecture; parallel programming models; Bandwidth; Computational modeling; Graphics processing unit; Performance evaluation; Pipeline processing; Runtime; CUDA; GPU; MPI; MPI-ACC; OpenCL;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on

Conference_Location :

Liverpool

Print_ISBN :

978-1-4673-2164-8

Type :

conf

DOI :

10.1109/HPCC.2012.92

Filename :

6332231

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1827894