DocumentCode :
3307889
Title :
Dealing with Driver Failures in the Storage Stack
Author :
Herder, Jorrit N. ; van Moolenbroek, D.C. ; Appuswamy, Rathinakumar ; Bingzheng Wu ; Gras, Ben ; Tanenbaum, Andrew S.
Author_Institution :
Dept. Comput. Sci., Vrije Univ., Amsterdam, Netherlands
fYear :
2009
fDate :
1-4 Sept. 2009
Firstpage :
119
Lastpage :
126
Abstract :
This work augments MINIX 3´s failure-resilience mechanisms with novel disk-driver recovery strategies and guaranteed file-system data integrity. We propose a flexible filter-driver framework that operates transparently to both the file system and the disk driver and enforces different protection strategies. The filter uses checksumming and mirroring in order to achieve end-to-end integrity and provide hard guarantees for detection of silent data corruption and recovery of lost data. In addition, the filter uses semantic information about the driver´s working in order to verify correct operation and proactively replace the driver if an anomaly is detected. We evaluated our design through a series of experiments on a prototype implementation: application-level benchmarks show modest performance overhead of 0-28% and software-implemented fault-injection (SWIFI) testing demonstrates the filter´s ability to detect and transparently recover from both data-integrity problems and driver-protocol violations.
Keywords :
data integrity; device drivers; disc drives; security of data; storage management; system recovery; MINIX 3; anomaly detection; checksumming; data integrity; data recovery; disk-driver recovery; driver failure; driver-protocol violation; end-to-end integrity; failure-resilience mechanism; file system; filter driver; mirroring; multiserver operating system; protection strategy; semantic information; silent data corruption; software-implemented fault-injection testing; storage stack; Computer bugs; Computer crashes; Computer science; Condition monitoring; File servers; File systems; Filters; Information filtering; Protection; Streaming media; Driver Failures; Fault Tolerance; MINIX 3; Storage Stack;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Computing, 2009. LADC '09. Fourth Latin-American Symposium on
Conference_Location :
Joao Pessoa
Print_ISBN :
978-1-4244-4678-0
Type :
conf
DOI :
10.1109/LADC.2009.12
Filename :
5234311
Link To Document :
بازگشت