• DocumentCode
    43719
  • Title

    Efficient Exact Regenerating Codes for Byzantine Fault Tolerance in Distributed Networked Storage

  • Author

    Han, Yunghsiang S. ; Hung-Ta Pai ; Rong Zheng ; Wai Ho Mow

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
  • Volume
    62
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb-14
  • Firstpage
    385
  • Lastpage
    397
  • Abstract
    Today´s large-scale distributed storage systems are commonly built using commodity software and hardware. As a result, crash-stop and Byzantine failures in such systems become more and more prevalent. In the literature, regenerating codes have been shown to be a more efficient way to disperse information across multiple storage nodes and recover from crash-stop failures. In this paper, we propose a novel decoding design of product-matrix constructed regenerating codes in conjunction with integrity check that allows exact regeneration of failed nodes and data reconstruction in the presence of Byzantine failures. A progressive decoding mechanism is incorporated in both procedures to leverage computation performed thus far. Unlike previous works, our new regenerating code decoding has the advantage that its building blocks, such as Reed-Solomon codes and standard cryptographic hash functions, are relatively well-understood because of their widespread applications. The fault tolerance and security properties of the proposed schemes are also analyzed. In addition, the performance of the proposed schemes, in terms of the average number of access nodes and the reconstruction failure probability versus the node failure probability, are also evaluated by Monte Carlo simulations.
  • Keywords
    Reed-Solomon codes; computer network security; cryptography; data handling; digital storage; fault tolerant computing; Byzantine failures; Byzantine fault tolerance; Monte Carlo simulation; Reed-Solomon codes; data reconstruction; decoding design; distributed networked storage; exact regenerating code; failed node reconstruction; integrity check; large scale distributed storage systems; node failure probability; product-matrix construction; progressive decoding; reconstruction failure probability; standard cryptographic hash functions; Bandwidth; Cryptography; Decoding; Encoding; Maintenance engineering; Polynomials; Symmetric matrices; Byzantine failures; Network storage; Reed-Solomon code; error-detection code; regenerating code;
  • fLanguage
    English
  • Journal_Title
    Communications, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0090-6778
  • Type

    jour

  • DOI
    10.1109/TCOMM.2013.122313.130492
  • Filename
    6698285