Title :
Scalable parallelization of skyline computation for multi-core processors
Author :
Chester, Sean ; Sidlauskas, Darius ; Assent, Ira ; Bogh, Kenneth S.
Author_Institution :
Data-Intensive Syst. Group, Aarhus Univ., Aarhus, Denmark
Abstract :
The skyline is an important query operator for multi-criteria decision making. It reduces a dataset to only those points that offer optimal trade-offs of dimensions. In general, it is very expensive to compute. Recently, multicore CPU algorithms have been proposed to accelerate the computation of the skyline. However, they do not sufficiently minimize dominance tests and so are not competitive with state-of-the-art sequential algorithms. In this paper, we introduce a novel multicore skyline algorithm, Hybrid, which processes points in blocks. It maintains a shared, global skyline among all threads, which is used to minimize dominance tests while maintaining high throughput. The algorithm uses an efficiently-updatable data structure over the shared, global skyline, based on point-based partitioning. Also, we release a large benchmark of optimized skyline algorithms, with which we demonstrate on challenging workloads a 100-fold speedup over state-of-the-art multicore algorithms and a 10-fold speedup with 16 cores over state-of-the-art sequential algorithms.
Keywords :
multi-threading; multiprocessing systems; query processing; data structure; dominance test minimization; hybrid algorithm; multicore CPU algorithms; multicore processors; multicriteria decision making; optimized skyline algorithms; point-based partitioning; query operator; scalable parallelization; shared-global skyline; skyline computation; throughput; Data structures; Fuels; Parallel processing;
Conference_Titel :
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICDE.2015.7113358