An Analysis of SMP Memory Allocators: MapReduce on Large Shared-Memory Systems

Author

Döbbelin, Robert ; Schütt, Thorsten ; Reinefeld, Alexander

fYear

2012

fDate

10-13 Sept. 2012

Firstpage

48

Lastpage

54

Abstract

The choice of a suitable memory allocation strategy greatly affects the performance of data-intensive applications on large shared-memory systems (SMPs). Standard memory allocators often provide poor performance because they do not properly reflect the different memory access latencies in deep NUMA architectures with their on-chip, off-chip, and off-blade communication. We analyze memory allocation strategies for data-intensive MapReduce applications on a large SMP with 512 cores and 2~TB main memory. We compare the efficiency of the MapReduce frameworks MR-Search and Phoenix++ and provide performance results on two benchmark applications, k-means and shortest-path search. Already on small SMPs with 128 cores a 6-fold speedup can be achieved by substituting the standard glibc by a better adapted memory allocation strategy, and these savings become more pronounced on larger SMPs. We identify two types of overhead: (1) the cost for executing the allocation requests and (2) poor memory locality caused by inefficient mapping to the underlying memory topology. We give detailed results on the NUMA traffic and show how the cost increases on large SMPs with many cores and a deep NUMA hierarchy.

Keywords

parallel processing; search problems; shared memory systems; storage allocation; storage management; MR-Search; MapReduce frameworks; NUMA traffic; Phoenix++; SMP memory allocator analysis; allocation request; data-intensive MapReduce application; data-intensive application; deep NUMA architecture; deep NUMA hierarchy; k-means search; large shared-memory system; memory access latency; memory allocation strategy; memory locality; memory topology; off-blade communication; off-chip communication; on-chip communication; shortest-path search; Arrays; Blades; Containers; Instruction sets; Memory management; Resource management; Vectors; NUMA; mapreduce; memory mangement;

fLanguage

English

Publisher

ieee

Conference_Titel

Parallel Processing Workshops (ICPPW), 2012 41st International Conference on

Conference_Location

Pittsburgh, PA

ISSN

1530-2016

Print_ISBN

978-1-4673-2509-7

Type

conf

DOI

10.1109/ICPPW.2012.10

Filename

6337462