Title :
High throughput compression of floating point numbers on graphical processing units
Author :
Padyana, A. ; Sudheer, C.D. ; Baruah, P.K. ; Srinivasan, A.
Author_Institution :
Dept. of Math. & Comput. Sci., Sri Sathya Sai Inst. of Higher Learning, Puttaparthi, India
Abstract :
Compute-intensive tasks in high-end high performance computing (HPC) systems often generate large amounts of data, especially floating-point data, that need to be transmitted over the network. Although computation speeds are very high, the overall performance of these applications is affected by the data transfer overhead. Moreover, as data sets are growing in size rapidly, bandwidth limitations pose a serious bottleneck in several scientific applications. Fast floating point compression can ameliorate the bandwidth limitations. If data is compressed well, then the amount of data transfer is reduced. This reduction in data transfer time comes at the expense of the increased computation required by compression and decompression. It is important for compression and decompression rates to be greater than the network bandwidth; otherwise, it will be faster to transmit uncompressed data directly. Accelerators such as Graphics Processing Units (GPU) provide much computational power. In this paper, we show that the computational power of GPUs can be harnessed to provide sufficiently fast compression and decompression for this approach to be effective for data produced by many practical applications. In particularly, we use Holt´s Exponential smoothing algorithm from time series analysis, and encode the difference between its predictions and the actual data. This yields a lossless compression scheme. We show that it can be implemented efficiently on GPUs to provide an effective compression scheme for the purpose of saving on data transfer overheads. The primary contribution of this work lies in demonstrating the potential of floating point compression in reducing the I/O bandwidth bottleneck on modern hardware for important classes of scientific applications.
Keywords :
data compression; exponential distribution; floating point arithmetic; graphics processing units; GPU; HPC systems; Holt exponential smoothing algorithm; accelerator; bandwidth limitation; data transfer overhead; decompression rate; fast floating point compression; floating point number; graphical processing unit; graphics processing unit; high throughput compression; high-end high performance computing; lossless compression scheme; network bandwidth; time series analysis; Lead; CellBE; Compression Ration; GPU; Holt´s Exponential Smoothening; Throughput;
Conference_Titel :
Parallel Distributed and Grid Computing (PDGC), 2012 2nd IEEE International Conference on
Conference_Location :
Solan
Print_ISBN :
978-1-4673-2922-4
DOI :
10.1109/PDGC.2012.6449838