DocumentCode :
141855
Title :
Parallel join executions in RAMCloud
Author :
Tinnefeld, Christian ; Kossmann, D. ; Boese, Joos-Hendrik ; Plattner, Hasso
Author_Institution :
Hasso Plattner Inst., Univ. of Potsdam, Potsdam, Germany
fYear :
2014
fDate :
March 31 2014-April 4 2014
Firstpage :
182
Lastpage :
190
Abstract :
Modern large-scale storage systems provide not only storage capacity, but also processing power. When such a storage system serves as persistence for a database application, it is desirable to utilize its processing power for supporting query execution. In this paper, we evaluate the parallel execution of join operations in Stanford´s RAMCloud which is a DRAM-based storage system connected via RDMA-enabled network adapters. We a) provide a system model to derive the execution costs for the Grace Join, the Distributed Block Nested Loop Join, and the Cyclo Join algorithm and their corresponding implementations in RAMCloud. We describe b) how the execution time for a single join operation depends on factors such as relation sizes, numbers of nodes used for a join, and the chosen algorithm. We finally introduce and evaluate c) a set of heuristics for parameterizing the execution of many join operations in parallel with the goal of maximizing the throughput.
Keywords :
DRAM chips; cloud computing; distributed algorithms; parallel processing; query processing; storage management; DRAM-based storage system; RDMA-enabled network adapters; Stanford RAMCloud; cyclo join algorithm; database application; distributed block nested loop join algorithm; grace join algorithm; heuristics; large-scale storage systems; parallel join executions; query execution; Art; Data models; Data transfer; Distributed databases; Partitioning algorithms; Silicon;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering Workshops (ICDEW), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/ICDEW.2014.6818325
Filename :
6818325
Link To Document :
بازگشت