Title :
Lone Star Stack: Architecture of a Disk-Based Archival System
Author :
Grawinkel, Matthias ; Best, Gregor ; Splietker, Malte ; Brinkmann, Andre
Author_Institution :
Johannes-Gutenberg Univ. Mainz, Mainz, Germany
Abstract :
The need for huge storage systems rises with the ever growing creation of data. With growing capacities and shrinking prices, "write once read sometimes" workloads become more common. New data is constantly added, rarely updated or deleted, and every stored byte might be read at any time - a common pattern for digital archives or big data scenarios. We present the LoneStar Stack, a disk based archival storage system building block that is optimized for high reliability and energy efficiency. It provides a POSIX file system interface that uses flash based storage for write-offloading and metadata and the disk-based LoneStar RAID for user data storage. The RAID attempts to spin down disks as soon and as long as possible. For reads, only a single disk is accessed, while writes require 3 additional parity disks to be spun up. The cache aggregates new files and a semantic data placement engine decides how they are persisted to the RAID. Asynchronous data movers then persist the data. The system provides an end-to-end data integrity, an elastic fault tolerance that can at least recover from all 3-disk failures, and provides multiple paths for data integrity checking and recovery. The system can use 70% of the raw disk capacity and is optimized for fast reads with a minimum number of powered on disk drives.
Keywords :
RAID; Unix; cache storage; data integrity; disc drives; fault tolerance; flash memories; information retrieval systems; memory architecture; meta data; 3-disk failures; LoneStar Stack; POSIX file system interface; asynchronous data movers; cache; data integrity checking; data integrity recovery; disk based archival storage system; disk capacity; disk drives; disk-based LoneStar RAID; disk-based archival system architecture; elastic fault tolerance; end-to-end data integrity; energy efficiency; flash based storage; huge storage systems; metadata; parity disks; reliability; semantic data placement engine; single disk; user data storage; write once read sometimes workloads; write-offloading; Ash; Databases; Disk drives; Nonvolatile memory; Random access memory; Reliability; Spinning; Archival Storage; Disk; Energy-efficient; MAID;
Conference_Titel :
Networking, Architecture, and Storage (NAS), 2014 9th IEEE International Conference on
Conference_Location :
Tianjin
DOI :
10.1109/NAS.2014.35