• DocumentCode
    1917659
  • Title

    An Analysis of SMP Memory Allocators: MapReduce on Large Shared-Memory Systems

  • Author

    Döbbelin, Robert ; Schütt, Thorsten ; Reinefeld, Alexander

  • fYear
    2012
  • fDate
    10-13 Sept. 2012
  • Firstpage
    48
  • Lastpage
    54
  • Abstract
    The choice of a suitable memory allocation strategy greatly affects the performance of data-intensive applications on large shared-memory systems (SMPs). Standard memory allocators often provide poor performance because they do not properly reflect the different memory access latencies in deep NUMA architectures with their on-chip, off-chip, and off-blade communication. We analyze memory allocation strategies for data-intensive MapReduce applications on a large SMP with 512 cores and 2~TB main memory. We compare the efficiency of the MapReduce frameworks MR-Search and Phoenix++ and provide performance results on two benchmark applications, k-means and shortest-path search. Already on small SMPs with 128 cores a 6-fold speedup can be achieved by substituting the standard glibc by a better adapted memory allocation strategy, and these savings become more pronounced on larger SMPs. We identify two types of overhead: (1) the cost for executing the allocation requests and (2) poor memory locality caused by inefficient mapping to the underlying memory topology. We give detailed results on the NUMA traffic and show how the cost increases on large SMPs with many cores and a deep NUMA hierarchy.
  • Keywords
    parallel processing; search problems; shared memory systems; storage allocation; storage management; MR-Search; MapReduce frameworks; NUMA traffic; Phoenix++; SMP memory allocator analysis; allocation request; data-intensive MapReduce application; data-intensive application; deep NUMA architecture; deep NUMA hierarchy; k-means search; large shared-memory system; memory access latency; memory allocation strategy; memory locality; memory topology; off-blade communication; off-chip communication; on-chip communication; shortest-path search; Arrays; Blades; Containers; Instruction sets; Memory management; Resource management; Vectors; NUMA; mapreduce; memory mangement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing Workshops (ICPPW), 2012 41st International Conference on
  • Conference_Location
    Pittsburgh, PA
  • ISSN
    1530-2016
  • Print_ISBN
    978-1-4673-2509-7
  • Type

    conf

  • DOI
    10.1109/ICPPW.2012.10
  • Filename
    6337462