• DocumentCode
    569508
  • Title

    RRSR: A Read Request Sorting and Reorganization Strategy to Improve Data Read Performance for Deduplication Systems Based on Sparse Index and Pipeline Parallelism

  • Author

    Xing, Qianqian ; Li, Feng ; Li, Jie

  • Author_Institution
    Comput. Sci. & Technol, Shandong Univ., Jinan, China
  • fYear
    2012
  • fDate
    17-19 Aug. 2012
  • Firstpage
    1021
  • Lastpage
    1024
  • Abstract
    In order to balance the deduplication rate which is an important evaluation metric that determines the disk space saved and the performance influenced by the throughput, and to enhance the capability of the dedupliacation system, a Read Request Sorting and Reorganization Strategy is proposed for avoiding the bottleneck of random read disk detention. The algorithm addresses the data read performance which becomes a critical bottleneck. The random reads require random disk seeks which affects IO throughputs in deduplication systems significantly. The algorithm caches the read requests in the pipeline pool, then utilizes the strategy to divide the IO requests into groups in accordance with the block id which uniquely identifies the block, and merges the identical requests to avoid the promotion of the workload. Then the system sorts the requests on the basis of sequence of fingerprints in Block Index. We built our prototype system based on the sparing index and pipeline parallelism to enhance the improved efficiency of deduplication cluster, and introduce new techniques and structure to accelerate the IO performance.
  • Keywords
    disc storage; merging; performance evaluation; pipeline processing; sorting; IO performance acceleration; IO requests; IO throughputs; RRSR strategy; block identification; block index; data read performance improvement; deduplication cluster; deduplication rate balancing; deduplication systems; disk space determination; identical request merging; pipeline parallelism; random read disk detention bottleneck avoidance; read request sorting and reorganization strategy; sparse index; workload promotion avoidance; Acceleration; File systems; Indexes; Parallel processing; Pipelines; Prototypes; Sorting; IO performance; IO pipeline; Read Request Sort and Reorganization; Sparse Index; deduplication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational and Information Sciences (ICCIS), 2012 Fourth International Conference on
  • Conference_Location
    Chongqing
  • Print_ISBN
    978-1-4673-2406-9
  • Type

    conf

  • DOI
    10.1109/ICCIS.2012.270
  • Filename
    6300788