DocumentCode
79922
Title
NUMA-Aware Scalable and Efficient In-Memory Aggregation on Large Domains
Author
Li Wang ; Minqi Zhou ; Zhenjie Zhang ; Ming-Chien Shan ; Aoying Zhou
Author_Institution
Software Eng. Inst., East China Normal Univ., Shanghai, China
Volume
27
Issue
4
fYear
2015
fDate
April 1 2015
Firstpage
1071
Lastpage
1084
Abstract
Business Intelligence (BI) is recognized as one of the most important IT applications in the coming big data era. In recent years, non-uniform memory access (NUMA) has become the de-facto architecture of multiprocessors on the new generation of enterprise servers. Such new architecture brings new challenges to optimization techniques on traditional operators in BI. Aggregation, for example, is one of the basic building blocks of BI, while its processing performance with existing hash-based algorithms scales poorly in terms of the number of cores under NUMA architecture. In this paper, we provide new solutions to tackle the problem of parallel hash-based aggregation, especially targeting at domains of extremely large cardinality. We propose a NUMA-aware radix partitioning (NaRP) method which divides the original huge relation table into subsets, without invoking expensive remote memory access between nodes of the cores. We also present a new efficient aggregation algorithm (EAA), to aggregate the partitioned data in parallel with low cache coherence miss and locking costs. Theoretical analysis as well as empirical study on an IBM X5 server prove that our proposals are at least two times faster than existing methods.
Keywords
Big Data; cache storage; competitive intelligence; cryptography; memory architecture; optimisation; parallel processing; BI; Big Data; EAA; IBM X5 server; IT applications; NUMA architecture; NUMA-aware radix partitioning method; NUMA-aware scalable aggregation; NaRP method; business intelligence; cache coherence miss; data partitioning; de-facto architecture; efficient aggregation algorithm; enterprise servers; hash-based algorithms; in-memory aggregation; locking costs; multiprocessors; nonuniform memory access; optimization techniques; parallel hash-based aggregation; Bismuth; Coherence; Multicore processing; Partitioning algorithms; Program processors; Servers; Aggregation; cache miss; in-memory databases; radix-partitioning;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2014.2359675
Filename
6906264
Link To Document