• DocumentCode
    3245224
  • Title

    Accelerating MapReduce with Distributed Memory Cache

  • Author

    Zhang, Shubin ; Han, Jizhong ; Liu, Zhiyong ; Wang, Kai ; Feng, Shengzhong

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
  • fYear
    2009
  • fDate
    8-11 Dec. 2009
  • Firstpage
    472
  • Lastpage
    478
  • Abstract
    MapReduce is a partition-based parallel programming model and framework enabling easy development of scalable parallel programs on clusters of commodity machines. In order to make time-intensive applications benefit from MapReduce on small scale clusters, this paper proposes a new method to improve the performance of MapReduce by using distributed memory cache as a high speed access between map tasks and reduce tasks. Map outputs sent to the distributed memory cache can be gotten by reduce tasks as soon as possible. Experiment results show that our prototype´s performance is much better than that of the original on small scale clusters. To our knowledge, this is the first effort to accelerate MapReduce with the help of distributed memory cache.
  • Keywords
    cache storage; distributed memory systems; parallel programming; MapReduce; distributed memory cache; high speed access; map tasks; partition-based parallel programming model; reduce tasks; scalable parallel programs; Acceleration; Computers; Concurrent computing; Delay; Distributed computing; Fault tolerance; Large-scale systems; Parallel programming; Programming profession; Prototypes; MapReduce; cluster computing; distributed memory cache; high speed access;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2009 15th International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1521-9097
  • Print_ISBN
    978-1-4244-5788-5
  • Type

    conf

  • DOI
    10.1109/ICPADS.2009.88
  • Filename
    5395321