DocumentCode :
3722949
Title :
FAIL*: An Open and Versatile Fault-Injection Framework for the Assessment of Software-Implemented Hardware Fault Tolerance
Author :
Horst Schirmeier;Martin Hoffmann;Christian Dietrich;Michael Lenz;Daniel Lohmann;Olaf Spinczyk
Author_Institution :
Dept. of Comput. Sci. 12, Tech. Univ. Dortmund, Dortmund, Germany
fYear :
2015
Firstpage :
245
Lastpage :
255
Abstract :
Due to voltage and structure shrinking, the influence of radiation on a circuit´s operation increases, resulting in future hardware designs exhibiting much higher rates of soft errors. Software developers have to cope with these effects to ensure functional safety. However, software-based hardware fault tolerance is a holistic property that is tricky to achieve in practice, potentially impaired by every single design decision. We present FAIL*, an open and versatile architecture-level fault-injection (FI) framework for the continuous assessment and quantification of fault tolerance in an iterative software development process. FAIL* supplies the developer with reusable and composable FI campaigns, advanced pre-and post-processing analyses to easily identify sensitive spots in the software, well-abstracted back-end implementations for several hardware and simulator platforms, and scalability of FI campaigns by providing massive parallelization. We describe FAIL*, its application to the development process of safety-critical software, and the lessons learned from a real-world example.
Keywords :
"Hardware","Software","Circuit faults","Fault tolerance","Fault tolerant systems","Computer architecture","Software measurement"
Publisher :
ieee
Conference_Titel :
Dependable Computing Conference (EDCC), 2015 Eleventh European
Type :
conf
DOI :
10.1109/EDCC.2015.28
Filename :
7371972
Link To Document :
بازگشت