• DocumentCode
    11933
  • Title

    An Improved Sub-Packetization Bound for Minimum Storage Regenerating Codes

  • Author

    Goparaju, Sreechakra ; Tamo, Itzhak ; Calderbank, Robert

  • Author_Institution
    Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA
  • Volume
    60
  • Issue
    5
  • fYear
    2014
  • fDate
    May-14
  • Firstpage
    2770
  • Lastpage
    2779
  • Abstract
    Distributed storage systems employ codes to provide resilience to failure of multiple storage disks. In particular, an (n, k) maximum distance separable (MDS) code stores k symbols in n disks such that the overall system is tolerant to a failure of up to n - k disks. However, access to at least k disks is still required to repair a single erasure. To reduce repair bandwidth, array codes are used where the stored symbols or packets are vectors of length ℓ. The MDS array codes have the potential to repair a single erasure using a fraction 1/(n - k) of data stored in the remaining disks. We introduce new methods of analysis, which capitalize on the translation of the storage system problem into a geometric problem on a set of operators and subspaces. In particular, we ask the following question: for a given (n, k), what is the minimum vector-length or subpacketization factor ℓ required to achieve this optimal fraction? For exact recovery of systematic disks in an MDS code of low redundancy, i.e., k/n > 1/2, the best known explicit codes have a subpacketization factor ℓ, which is exponential in k. It has been conjectured that for a fixed number of parity nodes, it is in fact necessary for ℓ to be exponential in k. In this paper, we provide a new log-squared converse bound on k for a given ℓ, and prove that k ≤ 2 log2 I(logδ ℓ + 1), for an arbitrary number of parity nodes r = n - k, where δ = r/(r - 1).
  • Keywords
    disc drives; error correction codes; MDS code; array codes; distributed storage systems; geometric problem; k disks; maximum distance separable code; minimum storage regenerating codes; multiple storage disks failure; parity nodes; subpacketization bound; subpacketization factor; vector-length; Bandwidth; Encoding; Maintenance engineering; Silicon; Systematics; Upper bound; Vectors; Distributed storage; error correction codes; interference alignment; sub-packetization;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2014.2309000
  • Filename
    6750093