• DocumentCode
    19346
  • Title

    Rethinking RAID-5 Data Layout for Better Scalability

  • Author

    Guangyan Zhang ; Weimin Zheng ; Keqin Li

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • Volume
    63
  • Issue
    11
  • fYear
    2014
  • fDate
    Nov. 2014
  • Firstpage
    2816
  • Lastpage
    2828
  • Abstract
    In RAID-5, data and parity blocks are distributed across all disks in a round-robin fashion. Previous approaches to RAID-5 scaling preserve such round-robin distribution, therefore requiring all the data to be migrated. In this paper, we rethink RAID-5 data layout and propose a new approach to RAID-5 scaling called MiPiL. First, MiPiL minimizes data migration while maintaining a uniform data distribution, not only for regular data but also for parity data. It moves the minimum number of data blocks from old disks to new disks for regaining a uniform data distribution. Second, MiPiL optimizes online data migration with piggyback parity updates and lazy metadata updates. Piggyback parity updates during data migration reduce the numbers of additional XOR computations and disk I/Os. Lazy metadata updates minimize the number of metadata writes without compromising data reliability. We implement MiPiL in Linux Kernel 2.6.32.9, and evaluate its performance by replaying three real-system traces. The results demonstrate that MiPiL consistently outperforms the existing “moving-everything” approach by 74.07-77.57% in redistribution time and by 25.78-70.50% in user response time. The experiments also illustrate that under the WebSearch2 and Financial1 workloads, the performance of the RAID-5 scaled using MiPiL is almost identical to that of the round-robin RAID-5.
  • Keywords
    Linux; RAID; file organisation; meta data; operating system kernels; reliability; Financial1 workloads; Linux Kernel 2.6.32.9; MiPiL; RAID-5 data layout; RAID-5 scaling; WebSearch2 workload; data reliability; minimizing data migration-piggyback parity updates-and-lazy metadata updates; moving-everything approach; online data migration; parity data; regular data; round-robin RAID-5; uniform data distribution; Data models; Disk arrays; Metadata; Parity check codes; Data migration; RAID-5 scaling; disk array; metadata update; parity update;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2013.143
  • Filename
    6552201