DocumentCode :
3454704
Title :
Improved read performance in a cost-effective, fault-tolerant parallel virtual file system (CEFT-PVFS)
Author :
Zhu, Yifeng ; Jiang, Hong ; Qin, Xiao ; Feng, Dan ; Swanson, David R.
Author_Institution :
Dept. of Comput. Sci. & Eng., Nebraska Univ., Lincoln, NE, USA
fYear :
2003
fDate :
12-15 May 2003
Firstpage :
730
Lastpage :
735
Abstract :
Due to the ever-widening performance gap between processors and disks, I/O operations tend to become the major performance bottleneck of data-intensive applications on modern clusters. If all the existing disks on the nodes of a cluster are connected together to establish high performance parallel storage systems, the cluster´s overall performance can be boosted at no additional cost. CEFT-PVFS (a RAID 10 style parallel file system that extends the original PVFS), as one such system, divides the cluster nodes into two groups, stripes the data across one group in a round-robin fashion, and then duplicates the same data to the other group to provide storage service of high performance and high reliability. Previous research has shown that the system reliability is improved by a factor of more than 40 with mirroring while maintaining a comparable write performance. This paper presents another benefit of CEFT-PVFS in which the aggregate peak read performance can be improved by as much as 100% over that of the original PVFS by exploiting the increased parallelism. Additionally, when the data servers, which typically are also computational nodes in a cluster environment, are loaded in an unbalanced way by applications running in the cluster, the read performance of PVFS will be degraded significantly. On the contrary, in the CEFT-PVFS, a heavily loaded data server can be skipped and all the desired data is read from its mirroring node. Thus the performance will not be affected unless both the server node and its mirroring node are heavily loaded.
Keywords :
fault tolerance; network operating systems; virtual storage; workstation clusters; cluster environment; cost-effective fault-tolerant system; data server; data-intensive applications; parallel virtual file system; read performance; storage system; system reliability; Aggregates; Application software; Computer science; Costs; Data engineering; Fault tolerant systems; File systems; Parallel processing; Reliability; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and the Grid, 2003. Proceedings. CCGrid 2003. 3rd IEEE/ACM International Symposium on
Print_ISBN :
0-7695-1919-9
Type :
conf
DOI :
10.1109/CCGRID.2003.1199440
Filename :
1199440
Link To Document :
بازگشت