DocumentCode :
231309
Title :
Region based fault-tolerant distributed file storage system design under budget constraint
Author :
Mazumder, Anisha ; Das, Arun ; Chenyang Zhou ; Sen, Arunabha
Author_Institution :
Sch. of Comput., Arizona State Univ., Tempe, AZ, USA
fYear :
2014
fDate :
17-19 Nov. 2014
Firstpage :
61
Lastpage :
68
Abstract :
Two independent lines of research, (i) erasure code based file storage system design, and (ii) fault-tolerant network design for spatially correlated (or region-based) failures, have received considerable attention in the networking research community in recent times. A recently proposed (N,K)-coding based distributed file storage scheme ensures complete reconstruction of a file after network fragmentation due to any single region-based fault. For every region of the network, it stores K distinct file segments in one of the largest connected component that results from the fragmentation of the network due to the failure of a region. This distribution scheme provides an all-region fault-tolerant storage system, in the sense that no matter which region of the network fails, a largest connected component of the fragmented network will still have enough distinct file segments with which to reconstruct the file. However, the storage requirement and the associated cost for such an all-region-fault-tolerant storage system may be quite high. As such, with a limited budget it may not be possible to realize such an all-region fault-tolerant storage system. We consider a budget constrained distributed file system design problem and provide solutions that maximizes the number of regions that can be made fault-tolerant, within the specified budget. We show that the problem is NP-complete, and provide an approximation algorithm for the problem. The performance of the approximation algorithm is evaluated through simulation on two real networks. The simulation results demonstrate that the worst case experimental performance is significantly better than the worst case theoretical bound. Moreover, the approximation algorithm almost always produce near optimal solution in a fraction of time needed to find the optimal solution.
Keywords :
computational complexity; fault tolerant computing; network operating systems; storage management; system recovery; (N,K)-coding based distributed file storage scheme; NP-complete problem; approximation algorithm; budget constraint; erasure code based file storage system design; fault-tolerant network design; fault-tolerant regions; file reconstruction; file segments; network failure; network fragmentation; networking research community; region based fault-tolerant distributed file storage system design; region-based failure; spatially correlated failure; storage requirement; worst case experimental performance; Algorithm design and analysis; Approximation algorithms; Approximation methods; Encoding; Fault tolerance; Fault tolerant systems; Vectors; (N,K) coding; approximation algorithm; budget; distributed data storage; region-based faults;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Networks Design and Modeling (RNDM), 2014 6th International Workshop on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4799-7039-1
Type :
conf
DOI :
10.1109/RNDM.2014.7014932
Filename :
7014932
Link To Document :
بازگشت