DocumentCode
263697
Title
A Heuristic Speculative Execution Strategy in Heterogeneous Distributed Environments
Author
Huicheng Wu ; Kenli Li ; Zhuo Tang ; Longxin Zhang
Author_Institution
Coll. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
fYear
2014
fDate
13-15 July 2014
Firstpage
268
Lastpage
273
Abstract
MapReduce is a distributed parallel computing framework for large-scale data processing with extensive applications. Hadoop MapReduce is the most widely employed open-source implementation of MapReduce framework for its flexible customization and simple usage. To avoid the relatively slow running task, called a straggler task, slowing down the job, MapReduce speculatively backups the straggler task on another node to execute aiming to reduce the job´s finish time. Although there have been many speculative execution strate-gies in heterogeneous environments, they all do not consider the impact of dynamic system load on the running time of tasks. They may make mistakes in determining stragglers. In our paper, we propose a novel speculative execution strategy in heterogeneous environments, ERUL, to im-prove the estimation of tasks´ rest time. ERUL also overcomes some drawbacks of LATE that mislead the speculative execution in some cases. The experimental result indicates that, our Hadoop-ERUL strategy not only works more accurately in the estimation of running tasks´ remaining execution time, but also reduces 26% job´s running time compared to Hadoop-LATE.
Keywords
cloud computing; parallel processing; public domain software; Hadoop MapReduce; Hadoop-ERUL strategy; Hadoop-LATE; MapReduce framework; cloud computing; distributed parallel computing framework; dynamic system load; heterogeneous distributed environments; heterogeneous environment; large-scale data processing; open-source implementation; speculative execution strategy; straggler task; Data models; Distributed databases; Educational institutions; Estimation; Heuristic algorithms; Load modeling; Open source software; MapReduce; cloud computing; dynamic loading; hadoop; speculative execution;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Architectures, Algorithms and Programming (PAAP), 2014 Sixth International Symposium on
Conference_Location
Beijing
ISSN
2168-3034
Print_ISBN
978-1-4799-3844-5
Type
conf
DOI
10.1109/PAAP.2014.29
Filename
6916476
Link To Document