DocumentCode :
1915419
Title :
Reducing the De-linearization of Data Placement to Improve Deduplication Performance
Author :
Yujuan Tan ; Zhichao Yan ; Dan Feng ; Sha, Edwin H-M ; Xiongzi Ge
Author_Institution :
Sch. of Comput. Sci. & Technol., Chongqing Univ., Chongqing, China
fYear :
2012
fDate :
10-16 Nov. 2012
Firstpage :
796
Lastpage :
800
Abstract :
Data deduplication is a lossless compression technology that replaces the redundant data chunks with pointers pointing to the already-stored ones. Due to this intrinsic data elimination feature, the deduplication commodity would delinearize the data placement and force the data chunks that belong to the same data object to be divided into multiple separate parts. In our preliminary study, it is found that the de-linearization of the data placement would weaken the data spatial locality that is used for improving data read performance, deduplication throughput and efficiency in some deduplication approaches, which significantly affects the deduplication performance. In this paper, we first analyze the negative effect of the de-linearization of data placement to the data deduplication performance with some examples and experimental evidences, and then propose an effective approach to reduce the de-linearization of data placement by sacrificing little compression ratios. The experimental evaluation driven by the real world datasets shows that our proposed approach effectively reduces the de-linearization of the data placement and enhances the data spatial locality, which significantly improves the deduplication performances including deduplication throughput, deduplication efficiency and data read performance, while at the cost of little compression ratios.
Keywords :
data compression; compression ratio; data chunk; data deduplication; data elimination feature; data object; data placement; data read performance; data spatial locality; deduplication approach; deduplication efficiency; deduplication throughput; lossless compression technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4673-6218-4
Type :
conf
DOI :
10.1109/SC.Companion.2012.110
Filename :
6495892
Link To Document :
بازگشت