DocumentCode :
2447407
Title :
Hybrid Map Task Scheduling for GPU-Based Heterogeneous Clusters
Author :
Shirahata, Koichi ; Sato, Hitoshi ; Matsuoka, Satoshi
Author_Institution :
Tokyo Inst. of Technol., Tokyo, Japan
fYear :
2010
fDate :
Nov. 30 2010-Dec. 3 2010
Firstpage :
733
Lastpage :
740
Abstract :
MapReduce is a programming model that enables efficient massive data processing in large-scale computing environments such as supercomputers and clouds. Such large-scale computers employ GPUs to enjoy its good peak performance and high memory bandwidth. Since the performance of each job is depending on running application characteristics and underlying computing environments, scheduling MapReduce tasks onto CPU cores and GPU devices for efficient execution is difficult. To address this problem, we have proposed a hybrid scheduling technique for GPU-based computer clusters, which minimizes the execution time of a submitted job using dynamic profiles of Map tasks running on CPU cores and GPU devices. We have implemented a prototype of our proposed scheduling technique by extending MapReduce framework, Hadoop. We have conducted some experiments for this prototype by using a K-means application as a benchmark on a supercomputer. The results show that the proposed technique achieves 1.93 times faster than the Hadoop original scheduling algorithm at 64 nodes (1024 CPU cores and 128 GPU devices). The results also indicate that the performance of map tasks, including both CPU and GPU tasks, is significantly affected by the overhead of map task invocation in the Hadoop framework.
Keywords :
computer graphic equipment; coprocessors; microcomputers; parallel machines; pattern clustering; GPU-based computer clusters; Hadoop original scheduling algorithm; K-means application; MapReduce model; data processing; hybrid map task scheduling; hybrid scheduling technique; memory bandwidth; programming model; supercomputer; Central Processing Unit; Computers; Graphics processing unit; Java; Performance evaluation; Processor scheduling; Prototypes; GPGPU; Job Scheduling; Large-scale data processing; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
978-1-4244-9405-7
Electronic_ISBN :
978-0-7695-4302-4
Type :
conf
DOI :
10.1109/CloudCom.2010.55
Filename :
5708524
Link To Document :
بازگشت