DocumentCode
358593
Title
Detailed radiation fault modeling of the Remote Exploration and Experimentation (REE) first generation testbed architecture
Author
Beahan, John ; Edmonds, Larry ; Ferraro, Robert D. ; Johnston, Allan ; Katz, Daniel S. ; Some, Raphael R.
Author_Institution
Jet Propulsion Lab., California Inst. of Technol., Pasadena, CA, USA
Volume
5
fYear
2000
fDate
2000
Firstpage
279
Abstract
The goal of the NASA HPCC Remote Exploration and Experimentation (REE) Project is to transfer commercial supercomputing technology into space. The project will use state of the art, low-power, non-radiation-hardened, COTS hardware chips and COTS software to the maximum extent possible, and will rely on software-implemented fault tolerance to provide the required levels of availability and reliability. We outline the methodology used to develop a detailed radiation fault model for the REE Testbed architecture. The model addresses the effects of energetic protons and heavy ions which cause single event upset and single event multiple upset events in digital logic devices and which are expected to be the primary fault generation mechanism. Unlike previous modeling efforts, this model will address fault rates and types in computer subsystems at a sufficiently fine level of granularity (i.e., the register level) that specific software and operational errors can be derived. We present the current state of the model, model verification activities and results to date, and plans for the future. Finally, we explain the methodology by which this model will be used to derive application-level error effects sets. These error effects sets will be used in conjunction with our Testbed fault injection capabilities and our applications´ mission scenarios to replicate the predicted fault environment on our suite of onboard applications
Keywords
aerospace computing; fault simulation; radiation hardening (electronics); software architecture; software fault tolerance; software tools; space vehicle electronics; COTS hardware chips; COTS software; Remote Exploration and Experimentation Project; SEU events; application-level error effects; commercial supercomputing technology; computer subsystems; digital logic devices; energetic proton effects; fault generation mechanism; fault injection capabilities; first generation testbed architecture; heavy ion effects; model verification; onboard applications; radiation fault modeling; register level; single event multiple upset events; software-implemented fault tolerance; Availability; Computer architecture; Fault tolerance; Hardware; Logic devices; NASA; Protons; Single event upset; Space technology; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Aerospace Conference Proceedings, 2000 IEEE
Conference_Location
Big Sky, MT
ISSN
1095-323X
Print_ISBN
0-7803-5846-5
Type
conf
DOI
10.1109/AERO.2000.878499
Filename
878499
Link To Document