DocumentCode
1934875
Title
A scheme of structured data compression and query on Hadoop platform
Author
Xiangwu Ding ; Bo Tian ; Yefeng Li
Author_Institution
Dept. of Comput. Sci. & Technol., Univ. of Donghua, Shanghai, China
fYear
2015
fDate
3-5 Feb. 2015
Firstpage
160
Lastpage
164
Abstract
We proposed a scheme of data compression and query technology to improve the performance of processing structured data on Hadoop platform. Firstly, we designed a data page structure for row-column hybrid storage based on HDFS. Then we proposed and implemented an adaptive lightweight data compression strategy based on MapReduce to compress and store data as the proposed storage structure. Finally, we provided a query strategy which directly execute on the compressed data of the given storage structure. The experiments conducted on the large-scale datasets demonstrated the effectiveness of the proposed strategy on reducing the amount of storage and improving query performance for structured data.
Keywords
data handling; data structures; HDFS; Hadoop platform; MapReduce; adaptive lightweight data compression strategy; data page structure; query performance; query strategy; query technology; row-column hybrid storage; storage structure; structured data compression; Big data; Compression algorithms; Data analysis; Data compression; Dictionaries; Encoding; Query processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Information, Networking, and Wireless Communications (DINWC), 2015 Third International Conference on
Conference_Location
Moscow
Print_ISBN
978-1-4799-6375-1
Type
conf
DOI
10.1109/DINWC.2015.7054235
Filename
7054235
Link To Document