DocumentCode :
121185
Title :
A High Performance Compression Method for Climate Data
Author :
Songbin Liu ; Xiaomeng Huang ; Yufang Ni ; Haohuan Fu ; Guangwen Yang
Author_Institution :
Minist. of Educ. Key Lab. for Earth Syst. Modeling, Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
26-28 Aug. 2014
Firstpage :
68
Lastpage :
77
Abstract :
Climate modeling data are usually multidimensional arrays of floating-point numbers. These arrays typically have two or three spatial dimensions and one temporal dimension, describing the evolvement of climate variables in a time span. With the advances of high performance computing, the volume of climate data is expanding exponentially, bringing tough challenges for climate data archiving and sharing. In this paper, we propose a lossless compression algorithm for the time-spatial climate floating-point arrays. Our compression algorithm can eliminate more data redundancy efficiently through adaptive prediction, XOR-differencing, and multi-way compression. In addition, static regions, which are very common in climate data, can be identified and compressed more efficiently. Moreover, to utilize the multi-cores on modern computers, we proposed a method to parallelize our compression algorithm. Evaluations demonstrate that single thread version of our compression method can achieve the best balance in compression ratios, deflating throughputs and inflating throughputs. And the parallel version can achieve 800 MB/s deflating throughputs and over 2600 MB/s inflating throughputs on a 16-core server.
Keywords :
data compression; geophysics computing; multiprocessing systems; parallel processing; 16-core server; XOR-differencing; adaptive prediction; climate data archiving; climate data sharing; climate modeling data; data redundancy elimination; deflating throughputs; high performance compression method; high performance computing; inflating throughputs; lossless compression algorithm; multicores; multidimensional floating-point number arrays; multiway compression; single thread compression method; spatial dimensions; temporal dimension; time-spatial climate floating-point arrays; Arrays; Compression algorithms; Compressors; Correlation; Data models; Meteorology; Throughput; climate data; data compression; parallel compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing with Applications (ISPA), 2014 IEEE International Symposium on
Conference_Location :
Milan
Type :
conf
DOI :
10.1109/ISPA.2014.18
Filename :
6924431
Link To Document :
بازگشت