Title :
Fault tolerant PVFS2 based on data replication
Author :
Nieto, Erik ; Camacho, Hugo E. ; Anguita, Mancia ; Díaz, Antonio F. ; Ortega, Julio
Author_Institution :
Dept. of Comput. Archit. & Technol., Univ. of Granada, Granada, Spain
Abstract :
Aggregating the capacity and bandwidth of the commodity disks in the nodes of a cluster provides cost effective and high performance storage systems. Nevertheless, this strategy could be a feasible approach only if the mean time to failure of disks and nodes is faced. The number of failures increases with the nodes and it is especially important in parallel file systems, like PVFS, because having a file striped over server disks increases the probability of failures. This work proposes a strategy to include data replication in the second version of PVFS in order to provide fault tolerance. We also analyze the performance of the implementation of this approach.
Keywords :
fault tolerant computing; replicated databases; data replication; fault tolerant PVFS2; parallel file systems; Bandwidth; Fault tolerance; Fault tolerant systems; File systems; Grid computing; Servers; Storage area networks;
Conference_Titel :
Parallel Distributed and Grid Computing (PDGC), 2010 1st International Conference on
Conference_Location :
Solan
Print_ISBN :
978-1-4244-7675-6
DOI :
10.1109/PDGC.2010.5679880