DocumentCode :
3462304
Title :
Scheduling Divisible Loads from Multiple Input Sources in MapReduce
Author :
Tao Gu ; Qun Liao ; Yulu Yang ; Tao Li
Author_Institution :
Coll. of Inf. Tech. Sci., Nankai Univ., Tianjin, China
fYear :
2013
fDate :
3-5 Dec. 2013
Firstpage :
1263
Lastpage :
1270
Abstract :
MapReduce is a programming model for data-intensive applications. The performance of MapReduce is greatly affected by the scheduling efficiency, which needs to consider both the parallelism of processing and overhead of data transmission. Lots of researches only focus on optimizing scheduling mechanisms or data storage strategies to reduce the overhead of data transmission in big data applications. The analysis of MapReduce carried out doesn´t consider the overhead of data transmission sufficiently. This paper proposes a divisible load scheduling model to describe the data transmission and task execution in MapReduce. The task allocation and corresponding data transmission are abstracted as scheduling divisible loads from multiple input sources. With the established model, the optimal distribution of input data and scheduling of loads in map phase are presented with linear programming. The performance of map phase is evaluated and analyzed under different environments.
Keywords :
Big Data; linear programming; parallel processing; resource allocation; storage management; Big Data applications; MapReduce; data storage strategies; data transmission overhead; data-intensive applications; divisible load scheduling model; linear programming; map phase performance; optimal data distribution; parallelism; programming model; scheduling efficiency; scheduling mechanisms; task allocation; task execution; Data communication; Data models; Equations; Linear programming; Load modeling; Processor scheduling; Silicon; Divisible Load Model; MapReduce; Multiple Input Sources; Performance Evaluation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
Conference_Location :
Sydney, NSW
Type :
conf
DOI :
10.1109/CSE.2013.188
Filename :
6755370
Link To Document :
بازگشت