DocumentCode :
3657088
Title :
Enabling Efficient and Reliable Transition from Replication to Erasure Coding for Clustered File Systems
Author :
Runhui Li;Yuchong Hu;Patrick P.C. Lee
Author_Institution :
Dept. of Comput. Sci. &
fYear :
2015
fDate :
6/1/2015 12:00:00 AM
Firstpage :
148
Lastpage :
159
Abstract :
To balance performance and storage efficiency, modern clustered file systems (CFSes) often first store data with random replication (i.e., distributing replicas across randomly selected nodes), followed by encoding the replicated data with erasure coding. We argue that random replication, while being commonly used, does not take into account erasure coding and hence will raise both performance and availability issues to the subsequent encoding operation. We propose encoding-aware replication, which carefully places the replicas so as to (i) avoid cross-rack downloads of data blocks during encoding, (ii) preserve availability without data relocation after encoding, and (iii) maintain load balancing as in random replication. We implement encoding-aware replication on HDFS, and show via tested experiments that it achieves significant encoding throughput gains over random replication. We also show via discrete-event simulations that encoding-aware replication remains effective under various parameter choices in a large-scale setting. We further show that encoding-aware replication evenly distributes replicas as in random replication.
Keywords :
"Encoding","Ear","Fault tolerance","Fault tolerant systems","Bipartite graph","Load management","Throughput"
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks (DSN), 2015 45th Annual IEEE/IFIP International Conference on
Type :
conf
DOI :
10.1109/DSN.2015.24
Filename :
7266846
Link To Document :
بازگشت