DocumentCode :
79922
Title :
NUMA-Aware Scalable and Efficient In-Memory Aggregation on Large Domains
Author :
Li Wang ; Minqi Zhou ; Zhenjie Zhang ; Ming-Chien Shan ; Aoying Zhou
Author_Institution :
Software Eng. Inst., East China Normal Univ., Shanghai, China
Volume :
27
Issue :
4
fYear :
2015
fDate :
April 1 2015
Firstpage :
1071
Lastpage :
1084
Abstract :
Business Intelligence (BI) is recognized as one of the most important IT applications in the coming big data era. In recent years, non-uniform memory access (NUMA) has become the de-facto architecture of multiprocessors on the new generation of enterprise servers. Such new architecture brings new challenges to optimization techniques on traditional operators in BI. Aggregation, for example, is one of the basic building blocks of BI, while its processing performance with existing hash-based algorithms scales poorly in terms of the number of cores under NUMA architecture. In this paper, we provide new solutions to tackle the problem of parallel hash-based aggregation, especially targeting at domains of extremely large cardinality. We propose a NUMA-aware radix partitioning (NaRP) method which divides the original huge relation table into subsets, without invoking expensive remote memory access between nodes of the cores. We also present a new efficient aggregation algorithm (EAA), to aggregate the partitioned data in parallel with low cache coherence miss and locking costs. Theoretical analysis as well as empirical study on an IBM X5 server prove that our proposals are at least two times faster than existing methods.
Keywords :
Big Data; cache storage; competitive intelligence; cryptography; memory architecture; optimisation; parallel processing; BI; Big Data; EAA; IBM X5 server; IT applications; NUMA architecture; NUMA-aware radix partitioning method; NUMA-aware scalable aggregation; NaRP method; business intelligence; cache coherence miss; data partitioning; de-facto architecture; efficient aggregation algorithm; enterprise servers; hash-based algorithms; in-memory aggregation; locking costs; multiprocessors; nonuniform memory access; optimization techniques; parallel hash-based aggregation; Bismuth; Coherence; Multicore processing; Partitioning algorithms; Program processors; Servers; Aggregation; cache miss; in-memory databases; radix-partitioning;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2014.2359675
Filename :
6906264
Link To Document :
بازگشت