DocumentCode :
1783684
Title :
On codes with availability for distributed storage
Author :
Rawat, A.S. ; Papailiopoulos, Dimitris S. ; Dimakis, Alexandros G. ; Vishwanath, Sriram
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
fYear :
2014
fDate :
21-23 May 2014
Firstpage :
15
Lastpage :
18
Abstract :
Modern large-scale distributed storage systems utilize erasure codes to store only cold data, i.e., rarely access data such as click logs. However, a major portion of the data that is currently used for large-scale processing is hot data, data that are frequently accessed, in some cases by many users or system processes simultaneously. When storing hot data, replication seems to be the option of choice for redundancy due to a very desirable property: a single information symbol can be accessed in parallel as many times as the number of available replicas. This is sometimes referred to as higher data availability. However, the rate of a replication scheme vanishes as we increase its availability or replication factor. This paper describes erasure codes that have arbitrarily high rate while allowing for high availability. In particular, these codes enable reconstruction of each information symbol from t disjoint groups of other code symbols, each of size at most r. This paper further shows that these codes attain a trade-off between minimum distance, availability and locality.
Keywords :
distributed processing; storage management; click logs; code symbols; cold data; distributed storage systems; erasure codes; higher data availability; hot data; replication factor; replication scheme; single information symbol; Availability; Linear codes; Maintenance engineering; Systematics; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Control and Signal Processing (ISCCSP), 2014 6th International Symposium on
Conference_Location :
Athens
Type :
conf
DOI :
10.1109/ISCCSP.2014.6877805
Filename :
6877805
Link To Document :
بازگشت