• DocumentCode
    119501
  • Title

    LaRS: A Load-Aware Recovery Scheme for Heterogeneous Erasure-Coded Storage Clusters

  • Author

    Haibing Luo ; Jianzhong Huang ; Qiang Cao ; Changsheng Xie

  • Author_Institution
    Wuhan Nat. Lab. for Optoelectron., Huazhong Univ. of Sci. & Technol., Wuhan, China
  • fYear
    2014
  • fDate
    6-8 Aug. 2014
  • Firstpage
    168
  • Lastpage
    175
  • Abstract
    To reduce the probability of data unavailability, it is extremely important to quickly recover failed data in a (k+r, k) erasure-coded storage cluster. In practice, storage nodes in a large-scale storage system have various network bandwidths and I/O capabilities, therefore, the heterogeneity of storage systems increases along with the growing scale. Both traditional recovery scheme and Fastest recovery scheme simply retrieve k surviving blocks from k surviving nodes, thereby resulting in low recovery performance in a heterogeneous storage cluster. In this paper, we propose a Load-aware Recovery Scheme (Lars) for heterogeneous RS-coded storage clusters. Lars not only takes into account both the heterogeneity and load of nodes, but also enables all surviving nodes to service reconstruction reads. The amount of surviving blocks retrieved by a surviving node depends on its load weight which is determined by both network bandwidth and I/O capacity. More blocks are fetched from faster nodes, and vice versa. The three recovery schemes are implemented on a 9-node heterogeneous RS-coded storage cluster, where a set of comparative experiments are conducted. The experimental results show that our Lars scheme outperforms the other two schemes by a factor of up to 1.58.
  • Keywords
    resource allocation; storage management; system recovery; IO capabilities; IO capacity; LaRS; data unavailability; heterogeneous RS-coded storage clusters; heterogeneous erasure-coded storage clusters; large-scale storage system; load weight; load-aware recovery scheme; network bandwidth; network bandwidths; surviving blocks; surviving nodes; Arrays; Bandwidth; Encoding; Redundancy; Servers; Time factors; Tin; erasure codes; heterogeneous storage; recovery scheme;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networking, Architecture, and Storage (NAS), 2014 9th IEEE International Conference on
  • Conference_Location
    Tianjin
  • Type

    conf

  • DOI
    10.1109/NAS.2014.34
  • Filename
    6923177