DocumentCode :
3090057
Title :
The Case for Evaluating MapReduce Performance Using Workload Suites
Author :
Chen, Yanpei ; Ganapathi, Archana ; Griffith, Rean ; Katz, Randy
Author_Institution :
Univ. of California, Berkeley, CA, USA
fYear :
2011
fDate :
25-27 July 2011
Firstpage :
390
Lastpage :
399
Abstract :
MapReduce systems face enormous challenges due to increasing growth, diversity, and consolidation of the data and computation involved. Provisioning, configuring, and managing large-scale MapReduce clusters require realistic, workload-specific performance insights that existing MapReduce benchmarks are ill-equipped to supply. In this paper, we build the case for going beyond benchmarks for MapReduce performance evaluations. We analyze and compare two production MapReduce traces to develop a vocabulary for describing MapReduce workloads. We show that existing benchmarks fail to capture rich workload characteristics observed in traces, and propose a framework to synthesize and execute representative workloads. We demonstrate that performance evaluations using realistic workloads gives cluster operator new ways to identify workload-specific resource bottlenecks, and workload-specific choice of MapReduce task schedulers. We expect that once available, workload suites would allow cluster operators to accomplish previously challenging tasks beyond what we can now imagine, thus serving as a useful tool to help design and manage MapReduce systems.
Keywords :
data handling; parallel processing; pattern clustering; performance evaluation; processor scheduling; resource allocation; MapReduce benchmark; MapReduce performance evaluation; MapReduce task scheduler; MapReduce workload; large-scale MapReduce cluster management; workload suites; workload-specific performance; workload-specific resource bottlenecks; Aggregates; Benchmark testing; Facebook; Measurement; Production; Transforms; Vocabulary; MapReduce; benchmark; performance; workload;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2011 IEEE 19th International Symposium on
Conference_Location :
Singapore
ISSN :
1526-7539
Print_ISBN :
978-1-4577-0468-0
Type :
conf
DOI :
10.1109/MASCOTS.2011.12
Filename :
6005383
Link To Document :
بازگشت