Title :
Rethinking RAID-5 Data Layout for Better Scalability
Author :
Guangyan Zhang ; Weimin Zheng ; Keqin Li
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
In RAID-5, data and parity blocks are distributed across all disks in a round-robin fashion. Previous approaches to RAID-5 scaling preserve such round-robin distribution, therefore requiring all the data to be migrated. In this paper, we rethink RAID-5 data layout and propose a new approach to RAID-5 scaling called MiPiL. First, MiPiL minimizes data migration while maintaining a uniform data distribution, not only for regular data but also for parity data. It moves the minimum number of data blocks from old disks to new disks for regaining a uniform data distribution. Second, MiPiL optimizes online data migration with piggyback parity updates and lazy metadata updates. Piggyback parity updates during data migration reduce the numbers of additional XOR computations and disk I/Os. Lazy metadata updates minimize the number of metadata writes without compromising data reliability. We implement MiPiL in Linux Kernel 2.6.32.9, and evaluate its performance by replaying three real-system traces. The results demonstrate that MiPiL consistently outperforms the existing “moving-everything” approach by 74.07-77.57% in redistribution time and by 25.78-70.50% in user response time. The experiments also illustrate that under the WebSearch2 and Financial1 workloads, the performance of the RAID-5 scaled using MiPiL is almost identical to that of the round-robin RAID-5.
Keywords :
Linux; RAID; file organisation; meta data; operating system kernels; reliability; Financial1 workloads; Linux Kernel 2.6.32.9; MiPiL; RAID-5 data layout; RAID-5 scaling; WebSearch2 workload; data reliability; minimizing data migration-piggyback parity updates-and-lazy metadata updates; moving-everything approach; online data migration; parity data; regular data; round-robin RAID-5; uniform data distribution; Data models; Disk arrays; Metadata; Parity check codes; Data migration; RAID-5 scaling; disk array; metadata update; parity update;
Journal_Title :
Computers, IEEE Transactions on
DOI :
10.1109/TC.2013.143