DocumentCode :
659432
Title :
CORE: Cross-object redundancy for efficient data repair in storage systems
Author :
Esmaili, Kyumars Sheykh ; Pamies-Juarez, Lluis ; Datta, Amitava
Author_Institution :
Nanyang Technol. Univ., Singapore, Singapore
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
246
Lastpage :
254
Abstract :
Erasure codes are an integral part of many distributed storage systems aimed at Big Data, since they provide high fault-tolerance for low overheads. However, traditional erasure codes are inefficient on replenishing lost data (vital for long term resilience) and on reading stored data in degraded environments (when nodes might be unavailable). Consequently, novel codes optimized to cope with distributed storage system nuances are vigorously being researched. In this paper, we take an engineering alternative, exploring the use of simple and mature techniques - juxtaposing a standard erasure code with RAID-4 like parity to realize cross object redundancy (CORE), and integrate it with HDFS. We benchmark the implementation in a proprietary cluster and in EC2. Our experiments show that for an extra 20% storage overhead (compared to traditional erasure codes) CORE yields up to 58% saving in bandwidth and is up to 76% faster while recovering a single failed node. The gains are respectively 16% and 64% for double node failures.
Keywords :
data structures; distributed databases; fault tolerant computing; CORE; EC2; HDFS; RAID-4 like parity; big data; cross-object redundancy; degraded environments; double node failures; efficient data repair; erasure codes; fault-tolerance; single failed node; storage systems; stored data; Encoding; Fault tolerant systems; Generators; Maintenance engineering; Product codes; Redundancy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691581
Filename :
6691581
Link To Document :
بازگشت