• DocumentCode
    2875778
  • Title

    HMF: High-available Message-passing Framework for Cluster File System

  • Author

    Yang, Dong ; Chen, Zhuan ; Tang, Rongfeng ; Xiong, Jin ; Meng, Dan

  • Author_Institution
    Inst. of Comput. Technol., Grad. Univ. of Chinese Acad. of Sci., Beijing, China
  • fYear
    2009
  • fDate
    9-11 July 2009
  • Firstpage
    249
  • Lastpage
    252
  • Abstract
    In large-scale cluster systems, the failure rate of network connection is non-negligibly high. A cluster file system must have the ability to handle network failures in order to provide high-available data accesses service. Traditionally, network failure handling is only guaranteed by network protocol, or implemented within the file system semantic layer. We present the high-available message-passing framework which is called HMF. Based on the operation hierarchy in cluster file system, HMF guarantees the availability of each pair of network transmissions and their interaction with the file system sub-operations. It separates the network fault-tolerance design from the file system and keeps a simple interface between them. HMF could handle a lot of network failures internally, which greatly simplifies the implementation of file system semantic layer. Performance results show that HMF can increase the availability of message passing and reduce the cost of recovery from network failures. When there are two network channels, HMF also improves aggregate I/O bandwidth by 80% in normal condition while the performance degradation due to recovery is below 10%.
  • Keywords
    fault tolerance; file organisation; file servers; message passing; transport protocols; cluster file system; high-available message-passing; network connection; network failure handling; network fault-tolerance design; network protocol; Access protocols; Aggregates; Availability; Bandwidth; Costs; Degradation; Fault tolerant systems; File systems; Large-scale systems; Message passing; cluster file system; high availability; message passing layer;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networking, Architecture, and Storage, 2009. NAS 2009. IEEE International Conference on
  • Conference_Location
    Hunan
  • Print_ISBN
    978-0-7695-3741-2
  • Type

    conf

  • DOI
    10.1109/NAS.2009.47
  • Filename
    5197333