Title :
Efficient many-core query execution in main memory column-stores
Author :
Dees, J. ; Sanders, P.
Author_Institution :
SAP AG, Walldorf, Germany
Abstract :
We use the full query set of the TPC-H Benchmark as a case study for the efficient implementation of decision support queries on main memory column-store databases. Instead of splitting a query into separate independent operators, we consider the query as a whole and translate the execution plan into a single function performing the query. This allows highly efficient CPU utilization, minimal materialization, and execution in a single pass over the data for most queries. The single pass is performed in parallel and scales near-linearly with the number of cores. The resulting query plans for most of the 22 queries are remarkably simple and are suited for automatic generation and fast compilation. Using a data-parallel, NUMA-aware many-core implementation with block summaries, inverted index data structures, and efficient aggregation algorithms, we achieve one to two orders of magnitude better performance than the current record holders of the TPC-H Benchmark.
Keywords :
data structures; decision support systems; multiprocessing systems; query processing; storage management; CPU utilization; NUMA-aware many-core implementation; TPC-H benchmark; aggregation algorithms; automatic generation; block summary; compilation; data-parallel; decision support query; execution plan; inverted index data structures; main memory column-store databases; many-core query execution; memory column-stores; minimal materialization; query plans; query set; separate independent operators; single pass; Bandwidth; Benchmark testing; Data structures; Indexes; Instruction sets; Sockets;
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
DOI :
10.1109/ICDE.2013.6544838