DocumentCode :
3740662
Title :
Dominoes: Speculative Repair in Erasure-Coded Hadoop System
Author :
Xi Yang;Chen Feng;Zhiwei Xu;Xian-He Sun
Author_Institution :
Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
fYear :
2015
Firstpage :
366
Lastpage :
375
Abstract :
Data volume grows dramatically in the era of big data. To save capital cost on storage hardware, datacenters currently prefer using erasure coding rather than simply replication to resist data loss. Erasure coding can provide equivalent three-way fault tolerance to HDFS´s default three replication mechanism but degrades data availability for task scheduling. In an erasure-coded system, data reconstruction time will be paid while tasks access the missing blocks during MapReduce job processing. Tasks´ accessing corrupt data introduces task stragglers and degrades resource utilization. To overcome these challenges, we propose a novel mechanism, Dominoes, that coordinates lightweight data states checking and job scheduling to hide such recovery penalty during job processing and enhances job throughputs. The experimental results confirm Dominoes´ effectiveness and efficiency that improves job throughput by 9% to 9.7% under failure at an overhead of 2.6% for failure-free jobs.
Keywords :
"Encoding","Maintenance engineering","Facebook","Metadata","Throughput","Production","Schedules"
Publisher :
ieee
Conference_Titel :
High Performance Computing (HiPC), 2015 IEEE 22nd International Conference on
Type :
conf
DOI :
10.1109/HiPC.2015.39
Filename :
7397652
Link To Document :
بازگشت