مرکز منطقه ای اطلاع رساني علوم و فناوري - Improved read performance in a cost-effective, fault-tolerant parallel virtual file system (CEFT-PVFS)

DocumentCode :

3454704

Title :

Improved read performance in a cost-effective, fault-tolerant parallel virtual file system (CEFT-PVFS)

Author :

Zhu, Yifeng ; Jiang, Hong ; Qin, Xiao ; Feng, Dan ; Swanson, David R.

Author_Institution :

Dept. of Comput. Sci. & Eng., Nebraska Univ., Lincoln, NE, USA

fYear :

2003

fDate :

12-15 May 2003

Firstpage :

730

Lastpage :

735

Abstract :

Due to the ever-widening performance gap between processors and disks, I/O operations tend to become the major performance bottleneck of data-intensive applications on modern clusters. If all the existing disks on the nodes of a cluster are connected together to establish high performance parallel storage systems, the cluster´s overall performance can be boosted at no additional cost. CEFT-PVFS (a RAID 10 style parallel file system that extends the original PVFS), as one such system, divides the cluster nodes into two groups, stripes the data across one group in a round-robin fashion, and then duplicates the same data to the other group to provide storage service of high performance and high reliability. Previous research has shown that the system reliability is improved by a factor of more than 40 with mirroring while maintaining a comparable write performance. This paper presents another benefit of CEFT-PVFS in which the aggregate peak read performance can be improved by as much as 100% over that of the original PVFS by exploiting the increased parallelism. Additionally, when the data servers, which typically are also computational nodes in a cluster environment, are loaded in an unbalanced way by applications running in the cluster, the read performance of PVFS will be degraded significantly. On the contrary, in the CEFT-PVFS, a heavily loaded data server can be skipped and all the desired data is read from its mirroring node. Thus the performance will not be affected unless both the server node and its mirroring node are heavily loaded.

Keywords :

fault tolerance; network operating systems; virtual storage; workstation clusters; cluster environment; cost-effective fault-tolerant system; data server; data-intensive applications; parallel virtual file system; read performance; storage system; system reliability; Aggregates; Application software; Computer science; Costs; Data engineering; Fault tolerant systems; File systems; Parallel processing; Reliability; Throughput;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Cluster Computing and the Grid, 2003. Proceedings. CCGrid 2003. 3rd IEEE/ACM International Symposium on

Print_ISBN :

0-7695-1919-9

Type :

conf

DOI :

10.1109/CCGRID.2003.1199440

Filename :

1199440

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3454704