Title :
Multiple-Job Optimization in MapReduce for Heterogeneous Workloads
Author :
Hu, Weisong ; Tian, Chao ; Liu, Xiaowei ; Qi, Hongwei ; Zha, Li ; Liao, Huaming ; Zhang, Yuezhuo ; Zhang, Jie
Author_Institution :
NEC Labs. China, Beijing, China
Abstract :
Map Reduce cluster is emerging as a solution of data-intensive scalable computing system. The open source implementation Hadoop has already been adopted for building clusters containing thousands of nodes. Such cloud infrastructure was used to processing many different jobs depending on different hardware resources, such as memory, CPU, Disk I/O and Network I/O, simultaneously. If the schedule policy does not consider the heterogeneity of running jobs´ resource utilization types, resource contention may happen. In this paper, we analyze this multiple job parallelization problems in Map Reduce, and propose the multiple-job optimization (MJO) scheduler. Our scheduler detects job´s resource utilization type on the fly and improves the hardware utilization by parallel different kinds of jobs. We give two scenarios which are “same plan” and “same job” to illustrate the multiple jobs´ submission traces in Map Reduce clusters. Our experiments show that in these scenarios, MJO scheduler could save the make span by about 20%.
Keywords :
parallel processing; public domain software; resource allocation; scheduling; Hadoop open source software; MapReduce cluster; cloud infrastructure; data-intensive scalable computing system; multiple job parallelization problems; multiple-job optimization scheduler; resource contention; resource utilization; MapReduce; Mutiple job optimization; Schdule; heterogeneous workloads;
Conference_Titel :
Semantics Knowledge and Grid (SKG), 2010 Sixth International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8125-5
Electronic_ISBN :
978-0-7695-4189-1
DOI :
10.1109/SKG.2010.23