DocumentCode :
257505
Title :
GPU-in-Hadoop: Enabling MapReduce across distributed heterogeneous platforms
Author :
Jie Zhu ; Juanjuan Li ; Hardesty, Erikson ; Hai Jiang ; Kuan-Ching Li
Author_Institution :
Dept. of Comput. Sci., Arkansas State Univ., Jonesboro, AR, USA
fYear :
2014
fDate :
4-6 June 2014
Firstpage :
321
Lastpage :
326
Abstract :
As the size of high performance applications increases, four major challenges including heterogeneity, programmability, failure resilience, and energy efficiency have arisen in the underlying distributed systems. To tackle with all of them without sacrificing performance, traditional approaches in resource utilization, task scheduling and programming paradigm should be reconsidered. As Hadoop has handled data-intensive applications well in Clouds, GPU has demonstrated its acceleration effectiveness for computation-intensive ones. This paper intends to integrate Hadoop with CUDA to exploit both CPU and GPU resources. Hadoop will schedule MapReduce´s Map and Reduce functions across multiple nodes, whereas CUDA code helps accelerate them further on local GPUs. All available heterogeneous computational power will be utilized. MapReduce in Hadoop will ease the programming task by hiding communication details. Hadoop Distributed File System will help achieve data-level fault resilience. GPU´s energy efficiency characteristics help reduce the power consumption of the whole system. To achieve Hadoop and GPU integration, four approaches including Jcuda, JNI, Hadoop Streaming, and Hadoop Pipes, have been accomplished. Experimental results have demonstrated their effectiveness.
Keywords :
cloud computing; parallel architectures; parallel programming; power aware computing; resource allocation; software fault tolerance; CPU resource; CUDA code; GPU integration; GPU resource; GPU-in-Hadoop; Hadoop distributed file system; Hadoop pipes; Hadoop streaming; JNI; Jcuda; MapReduce; clouds; data-intensive applications; data-level fault resilience; distributed heterogeneous platforms; distributed systems; energy efficiency characteristics; failure resilience; heterogeneous computational power; high performance applications; local GPUs; map function scheduling; power consumption reduction; programming paradigm; programming task; reduce function scheduling; resource utilization; task scheduling; Graphics processing units; Instruction sets; Java; Kernel; Libraries; Programming; Runtime; CUDA; Hadoop; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science (ICIS), 2014 IEEE/ACIS 13th International Conference on
Conference_Location :
Taiyuan
Type :
conf
DOI :
10.1109/ICIS.2014.6912154
Filename :
6912154
Link To Document :
بازگشت