Title :
Parallel lossless data compression on the GPU
Author :
Patel, Ritesh A. ; Zhang, Yao ; Mak, Jason ; Davidson, Andrew ; Owens, John D.
Author_Institution :
Univ. of California, Davis, Davis, CA, USA
Abstract :
We present parallel algorithms and implementations of a bzip2-like lossless data compression scheme for GPU architectures. Our approach parallelizes three main stages in the bzip2 compression pipeline: Burrows-Wheeler transform (BWT), move-to-front transform (MTF), and Huffman coding. In particular, we utilize a two-level hierarchical sort for BWT, design a novel scan-based parallel MTF algorithm, and implement a parallel reduction scheme to build the Huffman tree. For each algorithm, we perform detailed performance analysis, discuss its strengths and weaknesses, and suggest future directions for improvements. Overall, our GPU implementation is dominated by BWT performance and is 2.78× slower than bzip2, with BWT and MTF-Huffman respectively 2.89× and 1.34× slower on average.
Keywords :
Huffman codes; data compression; graphics processing units; parallel algorithms; telecommunication computing; trees (mathematics); BWT; Burrows-Wheeler transform; GPU architectures; Huffman coding; Huffman tree; MTF; bzip2-like lossless data compression scheme; move-to-front transform; parallel lossless data compression; parallel reduction scheme; scan-based parallel MTF algorithm; Arrays; Encoding; Graphics processing unit; Histograms; Humans; Instruction sets; Transforms;
Conference_Titel :
Innovative Parallel Computing (InPar), 2012
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4673-2632-2
Electronic_ISBN :
978-1-4673-2631-5
DOI :
10.1109/InPar.2012.6339599