DocumentCode
618573
Title
CORE: Augmenting regenerating-coding-based recovery for single and concurrent failures in distributed storage systems
Author
Runhui Li ; Jian Lin ; Lee, Patrick P. C.
Author_Institution
Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, Hong Kong, China
fYear
2013
fDate
6-10 May 2013
Firstpage
1
Lastpage
6
Abstract
Data availability is critical in distributed storage systems, especially when node failures are prevalent in real life. A key requirement is to minimize the amount of data transferred among nodes when recovering the lost or unavailable data of failed nodes. This paper explores recovery solutions based on regenerating codes, which are shown to provide fault-tolerant storage and minimum recovery bandwidth. Existing optimal regenerating codes are designed for single node failures. We build a system called CORE, which augments existing optimal regenerating codes to support a general number of failures including single and concurrent failures. We theoretically show that CORE achieves the minimum possible recovery bandwidth for most cases. We implement CORE and evaluate our prototype atop a Hadoop HDFS cluster testbed with up to 20 storage nodes. We demonstrate that our CORE prototype conforms to our theoretical findings and achieves recovery bandwidth saving when compared to the conventional recovery approach based on erasure codes.
Keywords
cloud computing; codes; storage management; CORE; Hadoop HDFS cluster; concurrent failure; distributed storage system; fault-tolerant storage; optimal regenerating codes; regenerating-coding-based recovery; Availability; Bandwidth; Encoding; Equations; Nickel; Strips; Throughput; coding theory; distributed storage systems; experiments and implementation; failure recovery; regenerating codes;
fLanguage
English
Publisher
ieee
Conference_Titel
Mass Storage Systems and Technologies (MSST), 2013 IEEE 29th Symposium on
Conference_Location
Long Beach, CA
ISSN
2160-195X
Print_ISBN
978-1-4799-0217-0
Type
conf
DOI
10.1109/MSST.2013.6558428
Filename
6558428
Link To Document