Title :
MRBench: A Benchmark for MapReduce Framework
Author :
Kim, Kiyoung ; Jeon, Kyungho ; Han, Hyuck ; Kim, Shin-gyu ; Jung, Hyungsoo ; Yeom, Heon Y.
Author_Institution :
Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
Abstract :
MapReduce is Google´s programming model for easy development of scalable parallel applications which process huge quantity of data on many clusters. Due to its conveniency and efficiency, MapReduce is used in various applications (e.g., Web search services and online analytical processing). However, there are only few good benchmarks to evaluate MapReduce implementations by realistic testsets. In this paper, we present MRBench that is a benchmark for evaluating MapReduce systems. MRBench focuses on processing business oriented queries and concurrent data modifications. To this end, we build MRBench to deal with large volumes of relational data and execute highly complex queries. By MRBench, users can evaluate the performance of MapReduce systems while varying environmental parameters such as data size and the number of (map/reduce) tasks. Our extensive experimental results show that MRBench is a useful tool to benchmark the capability of answering critical business questions.
Keywords :
parallel programming; pattern clustering; query processing; Google programming model; MapReduce framework; Web search services; business oriented queries; concurrent data modifications; critical business questions; online analytical processing; Application software; Benchmark testing; Computer science; Costs; Data engineering; Fault tolerance; Functional programming; Parallel processing; Parallel programming; Web search; Benchmark; MapReduce; TPC-H;
Conference_Titel :
Parallel and Distributed Systems, 2008. ICPADS '08. 14th IEEE International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-0-7695-3434-3
DOI :
10.1109/ICPADS.2008.70