Title :
Benchmarking Dependability of MapReduce Systems
Author :
Sangroya, Amit ; Serrano, D. ; Bouchenak, S.
Author_Institution :
LIG, INRIA, Grenoble, France
Abstract :
MapReduce is a popular programming model for distributed data processing. Extensive research has been conducted on the reliability of MapReduce, ranging from adaptive and on-demand fault-tolerance to new fault-tolerance models. However, realistic benchmarks are still missing to analyze and compare the effectiveness of these proposals. To date, most MapReduce fault-tolerance solutions have been evaluated using micro benchmarks in an ad-hoc and overly simplified setting, which may not be representative of real-world applications. This paper presents MRBS, a comprehensive benchmark suite for evaluating the dependability of MapReduce systems. MRBS includes five benchmarks covering several application domains and a wide range of execution scenarios such as data-intensive vs. compute-intensive applications, or batch applications vs. online interactive applications. MRBS allows to inject various types of faults at different rates. It also considers different application workloads and data loads, and produces extensive reliability, availability and performance statistics. We illustrate the use of MRBS with Hadoop clusters running on Amazon EC2, and on a private cloud.
Keywords :
cloud computing; software fault tolerance; software performance evaluation; Amazon EC2; Hadoop clusters; MRBS; MapReduce reliability; MapReduce systems; adaptive fault-tolerance model; application data loads; application workloads; batch applications; benchmarking dependability; compute-intensive applications; data-intensive applications; distributed data processing; micro benchmarks; ondemand fault-tolerance model; online interactive applications; performance statistics; private cloud; programming model; Benchmark testing; Cloud computing; Computer crashes; Fault tolerance; Fault tolerant systems; Motion pictures; Benchmark; Dependability; Hadoop; MapReduce;
Conference_Titel :
Reliable Distributed Systems (SRDS), 2012 IEEE 31st Symposium on
Conference_Location :
Irvine, CA
Print_ISBN :
978-1-4673-2397-0
DOI :
10.1109/SRDS.2012.12