DocumentCode
3245224
Title
Accelerating MapReduce with Distributed Memory Cache
Author
Zhang, Shubin ; Han, Jizhong ; Liu, Zhiyong ; Wang, Kai ; Feng, Shengzhong
Author_Institution
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
fYear
2009
fDate
8-11 Dec. 2009
Firstpage
472
Lastpage
478
Abstract
MapReduce is a partition-based parallel programming model and framework enabling easy development of scalable parallel programs on clusters of commodity machines. In order to make time-intensive applications benefit from MapReduce on small scale clusters, this paper proposes a new method to improve the performance of MapReduce by using distributed memory cache as a high speed access between map tasks and reduce tasks. Map outputs sent to the distributed memory cache can be gotten by reduce tasks as soon as possible. Experiment results show that our prototype´s performance is much better than that of the original on small scale clusters. To our knowledge, this is the first effort to accelerate MapReduce with the help of distributed memory cache.
Keywords
cache storage; distributed memory systems; parallel programming; MapReduce; distributed memory cache; high speed access; map tasks; partition-based parallel programming model; reduce tasks; scalable parallel programs; Acceleration; Computers; Concurrent computing; Delay; Distributed computing; Fault tolerance; Large-scale systems; Parallel programming; Programming profession; Prototypes; MapReduce; cluster computing; distributed memory cache; high speed access;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems (ICPADS), 2009 15th International Conference on
Conference_Location
Shenzhen
ISSN
1521-9097
Print_ISBN
978-1-4244-5788-5
Type
conf
DOI
10.1109/ICPADS.2009.88
Filename
5395321
Link To Document