DocumentCode :
2906792
Title :
Design and Implementation of MapReduce Using the PGAS Programming Model with UPC
Author :
Teijeiro, Carlos ; Taboada, Guillermo L. ; Touriño, Juan ; Doallo, Ramón
Author_Institution :
Dept. of Electron. & Syst., Univ. of A Coruna, A Coruña, Spain
fYear :
2011
fDate :
7-9 Dec. 2011
Firstpage :
196
Lastpage :
203
Abstract :
MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require low-latency communications, the adoption of MapReduce in High Performance Computing (HPC) is still emerging. Here languages based on the Partitioned Global Address Space (PGAS) programming model have shown to be a good choice for implementing parallel applications, in order to take advantage of the increasing number of cores per node and the programmability benefits achieved by their global memory view, such as the transparent access to remote data. This paper presents the first PGAS-based MapReduce implementation that uses the Unified Parallel C (UPC) language, which (1) obtains programmability benefits in parallel programming, (2) offers advanced configuration options to define a customized load distribution for different codes, and (3) overcomes performance penalties and bottlenecks that have traditionally prevented the deployment of MapReduce applications in HPC. The performance evaluation of representative applications on shared and distributed memory environments assesses the scalability of the presented MapReduce framework, confirming its suitability.
Keywords :
C language; distributed shared memory systems; parallel languages; parallel programming; software performance evaluation; MapReduce; Unified Parallel C language; customized load distribution; distributed environments; distributed memory environments; high performance computing; parallel programming; partitioned global address space programming model; performance evaluation; shared memory environments; Electronics packaging; Instruction sets; Java; Libraries; Multicore processing; Programming; HPC; MapReduce; UPC; collective primitives; programmability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on
Conference_Location :
Tainan
ISSN :
1521-9097
Print_ISBN :
978-1-4577-1875-5
Type :
conf
DOI :
10.1109/ICPADS.2011.162
Filename :
6121278
Link To Document :
بازگشت