Title :
RRSR: A Read Request Sorting and Reorganization Strategy to Improve Data Read Performance for Deduplication Systems Based on Sparse Index and Pipeline Parallelism
Author :
Xing, Qianqian ; Li, Feng ; Li, Jie
Author_Institution :
Comput. Sci. & Technol, Shandong Univ., Jinan, China
Abstract :
In order to balance the deduplication rate which is an important evaluation metric that determines the disk space saved and the performance influenced by the throughput, and to enhance the capability of the dedupliacation system, a Read Request Sorting and Reorganization Strategy is proposed for avoiding the bottleneck of random read disk detention. The algorithm addresses the data read performance which becomes a critical bottleneck. The random reads require random disk seeks which affects IO throughputs in deduplication systems significantly. The algorithm caches the read requests in the pipeline pool, then utilizes the strategy to divide the IO requests into groups in accordance with the block id which uniquely identifies the block, and merges the identical requests to avoid the promotion of the workload. Then the system sorts the requests on the basis of sequence of fingerprints in Block Index. We built our prototype system based on the sparing index and pipeline parallelism to enhance the improved efficiency of deduplication cluster, and introduce new techniques and structure to accelerate the IO performance.
Keywords :
disc storage; merging; performance evaluation; pipeline processing; sorting; IO performance acceleration; IO requests; IO throughputs; RRSR strategy; block identification; block index; data read performance improvement; deduplication cluster; deduplication rate balancing; deduplication systems; disk space determination; identical request merging; pipeline parallelism; random read disk detention bottleneck avoidance; read request sorting and reorganization strategy; sparse index; workload promotion avoidance; Acceleration; File systems; Indexes; Parallel processing; Pipelines; Prototypes; Sorting; IO performance; IO pipeline; Read Request Sort and Reorganization; Sparse Index; deduplication;
Conference_Titel :
Computational and Information Sciences (ICCIS), 2012 Fourth International Conference on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4673-2406-9
DOI :
10.1109/ICCIS.2012.270