DocumentCode :
2441374
Title :
High performance comparison-based sorting algorithm on many-core GPUs
Author :
Ye, Xiaochun ; Fan, Dongrui ; Lin, Wei ; Yuan, Nan ; Ienne, Paolo
Author_Institution :
Key Lab. of Comput. Syst. & Archit., Chinese Acad. of Sci., Beijing, China
fYear :
2010
fDate :
19-23 April 2010
Firstpage :
1
Lastpage :
10
Abstract :
Sorting is a kernel algorithm for a wide range of applications. In this paper, we present a new algorithm, GPU-Warpsort, to perform comparison-based parallel sort on Graphics Processing Units (GPUs). It mainly consists of a bitonic sort followed by a merge sort. Our algorithm achieves high performance by efficiently mapping the sorting tasks to GPU architectures. Firstly, we take advantage of the synchronous execution of threads in a warp to eliminate the barriers in bitonic sorting network. We also provide sufficient homogeneous parallel operations for all the threads within a warp to avoid branch divergence. Furthermore, we implement the merge sort efficiently by assigning each warp independent pairs of sequences to be merged and by exploiting totally coalesced global memory accesses to eliminate the bandwidth bottleneck. Our experimental results indicate that GPU-Warpsort works well on different kinds of input distributions, and it achieves up to 30% higher performance than previous optimized comparison-based GPU sorting algorithm on input sequences with millions of elements.
Keywords :
computer graphic equipment; coprocessors; parallel architectures; sorting; GPU architecture; GPU-Warpsort; bitonic sorting network; comparison-based parallel sort; graphics processing units; high performance comparison; homogeneous parallel operation; kernel algorithm; many-core GPU; merge sort; sorting algorithm; synchronous execution; Bitonic Network; CUDA; GPU; Many-Core; Merge Sort; Sorting Algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
ISSN :
1530-2075
Print_ISBN :
978-1-4244-6442-5
Type :
conf
DOI :
10.1109/IPDPS.2010.5470445
Filename :
5470445
Link To Document :
بازگشت