DocumentCode
2875778
Title
HMF: High-available Message-passing Framework for Cluster File System
Author
Yang, Dong ; Chen, Zhuan ; Tang, Rongfeng ; Xiong, Jin ; Meng, Dan
Author_Institution
Inst. of Comput. Technol., Grad. Univ. of Chinese Acad. of Sci., Beijing, China
fYear
2009
fDate
9-11 July 2009
Firstpage
249
Lastpage
252
Abstract
In large-scale cluster systems, the failure rate of network connection is non-negligibly high. A cluster file system must have the ability to handle network failures in order to provide high-available data accesses service. Traditionally, network failure handling is only guaranteed by network protocol, or implemented within the file system semantic layer. We present the high-available message-passing framework which is called HMF. Based on the operation hierarchy in cluster file system, HMF guarantees the availability of each pair of network transmissions and their interaction with the file system sub-operations. It separates the network fault-tolerance design from the file system and keeps a simple interface between them. HMF could handle a lot of network failures internally, which greatly simplifies the implementation of file system semantic layer. Performance results show that HMF can increase the availability of message passing and reduce the cost of recovery from network failures. When there are two network channels, HMF also improves aggregate I/O bandwidth by 80% in normal condition while the performance degradation due to recovery is below 10%.
Keywords
fault tolerance; file organisation; file servers; message passing; transport protocols; cluster file system; high-available message-passing; network connection; network failure handling; network fault-tolerance design; network protocol; Access protocols; Aggregates; Availability; Bandwidth; Costs; Degradation; Fault tolerant systems; File systems; Large-scale systems; Message passing; cluster file system; high availability; message passing layer;
fLanguage
English
Publisher
ieee
Conference_Titel
Networking, Architecture, and Storage, 2009. NAS 2009. IEEE International Conference on
Conference_Location
Hunan
Print_ISBN
978-0-7695-3741-2
Type
conf
DOI
10.1109/NAS.2009.47
Filename
5197333
Link To Document