DocumentCode :
1919350
Title :
A CUDA-MPI Hybrid Bitonic Sorting Algorithm for GPU Clusters
Author :
White, Sam ; Verosky, Niels ; Newhall, Tia
Author_Institution :
Comput. Sci. Dept., Swarthmore Coll., Swarthmore, PA, USA
fYear :
2012
fDate :
10-13 Sept. 2012
Firstpage :
588
Lastpage :
589
Abstract :
We present a hybrid CUDA-MPI sorting algorithm that makes use of GPU clusters to sort large data sets. Our algorithm has two phases. In the first phase each node sorts a portion of the data on its GPU using a parallel bitonic sort. In the second phase the sorted subsequences are merged together in parallel using a reduction sorting network implemented in MPI across the cluster nodes. Performance results comparing our sorting algorithm to sequential quick sort yield speed-up values of up to 9.8 for sorting 4GB of data on a 32 node GPU cluster. We anticipate even better speed-up values using our algorithm on larger data sets and larger sized clusters.
Keywords :
application program interfaces; data reduction; graphics processing units; parallel architectures; pattern clustering; sorting; CUDA; GPU cluster; MPI; cluster node; data set; hybrid bitonic sorting algorithm; parallel bitonic sort; reduction sorting network; Algorithm design and analysis; Clustering algorithms; Graphics processing unit; Parallel processing; Random access memory; Runtime; Sorting; GPU clusters; hybrid CUDA-MPI; parallel sorting algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing Workshops (ICPPW), 2012 41st International Conference on
Conference_Location :
Pittsburgh, PA
ISSN :
1530-2016
Print_ISBN :
978-1-4673-2509-7
Type :
conf
DOI :
10.1109/ICPPW.2012.82
Filename :
6337530
Link To Document :
بازگشت