Title :
Cogset vs. Hadoop: Measurements and Analysis
Author :
Valvåg, Steffen Viken ; Johansen, Dag ; Kvalnes, Åge
Author_Institution :
Dept. of Comput. Sci., Univ. of Tromso, Tromsø, Norway
fDate :
Nov. 30 2010-Dec. 3 2010
Abstract :
Cogset is an efficient and generic engine for reliable storage and parallel processing of data. It supports a number of high-level programming interfaces, including a MapReduce interface compatible with Hadoop. In this paper, we evaluate Cogset´s performance as a MapReduce engine, comparing it to Hadoop. Our results show that Cog set generally outperforms Hadoop by a significant margin. We investigate the causes of this gap in performance and demonstrate some relatively minor modifications that markedly improveHadoop´s performance, closing some of the gap.
Keywords :
file organisation; parallel processing; Cogset; Hadoop; MapReduce interface; high-level programming interfaces; parallel processing; reliable data storage; Aggregates; Benchmark testing; Engines; IP networks; Indexes; Optimization; Benchmark; MapReduce; Optimization;
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
978-1-4244-9405-7
Electronic_ISBN :
978-0-7695-4302-4
DOI :
10.1109/CloudCom.2010.103