DocumentCode
1917659
Title
An Analysis of SMP Memory Allocators: MapReduce on Large Shared-Memory Systems
Author
Döbbelin, Robert ; Schütt, Thorsten ; Reinefeld, Alexander
fYear
2012
fDate
10-13 Sept. 2012
Firstpage
48
Lastpage
54
Abstract
The choice of a suitable memory allocation strategy greatly affects the performance of data-intensive applications on large shared-memory systems (SMPs). Standard memory allocators often provide poor performance because they do not properly reflect the different memory access latencies in deep NUMA architectures with their on-chip, off-chip, and off-blade communication. We analyze memory allocation strategies for data-intensive MapReduce applications on a large SMP with 512 cores and 2~TB main memory. We compare the efficiency of the MapReduce frameworks MR-Search and Phoenix++ and provide performance results on two benchmark applications, k-means and shortest-path search. Already on small SMPs with 128 cores a 6-fold speedup can be achieved by substituting the standard glibc by a better adapted memory allocation strategy, and these savings become more pronounced on larger SMPs. We identify two types of overhead: (1) the cost for executing the allocation requests and (2) poor memory locality caused by inefficient mapping to the underlying memory topology. We give detailed results on the NUMA traffic and show how the cost increases on large SMPs with many cores and a deep NUMA hierarchy.
Keywords
parallel processing; search problems; shared memory systems; storage allocation; storage management; MR-Search; MapReduce frameworks; NUMA traffic; Phoenix++; SMP memory allocator analysis; allocation request; data-intensive MapReduce application; data-intensive application; deep NUMA architecture; deep NUMA hierarchy; k-means search; large shared-memory system; memory access latency; memory allocation strategy; memory locality; memory topology; off-blade communication; off-chip communication; on-chip communication; shortest-path search; Arrays; Blades; Containers; Instruction sets; Memory management; Resource management; Vectors; NUMA; mapreduce; memory mangement;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing Workshops (ICPPW), 2012 41st International Conference on
Conference_Location
Pittsburgh, PA
ISSN
1530-2016
Print_ISBN
978-1-4673-2509-7
Type
conf
DOI
10.1109/ICPPW.2012.10
Filename
6337462
Link To Document