• DocumentCode
    3738066
  • Title

    A self-aware data compression system on FPGA in Hadoop

  • Author

    Yubin Li;Yuliang Sun;Guohao Dai;Yuzhi Wang;Jiacai Ni;Yu Wang;Guoliang Li;Huazhong Yang

  • Author_Institution
    Dept. E.E., Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing, China
  • fYear
    2015
  • Firstpage
    196
  • Lastpage
    199
  • Abstract
    With the exponential growth of data size, data storage and analysis have been exposed to more challenges due to the lack of disk capacity and the limited network bandwidth. Data compression technique provides a good solution to mitigate these effects. In this paper, we propose a self-aware data compression system on FPGA for typical data warehousing, such as Hive, with column stored data and multi-threading requirements. The hardware accelerators can change the degree and hierarchy of parallelism depending on the data to be compressed (during the runtime). We test the system performance on a Xilinx VC707 FPGA board and the experimental results show that, up to 16 3-parallelism accelerators can be implemented and the throughput could be improved up to 432 MB/s. It is 6.25X speedup compared with the software solution under the same number of threads.
  • Keywords
    "Hardware","Engines","Field programmable gate arrays","Computer architecture","Data compression","Parallel processing","Throughput"
  • Publisher
    ieee
  • Conference_Titel
    Field Programmable Technology (FPT), 2015 International Conference on
  • Type

    conf

  • DOI
    10.1109/FPT.2015.7393149
  • Filename
    7393149