Title :
Reliability and availability analysis for the JPL Remote Exploration and Experimentation System
Author :
Chen, Dong ; Dharmaraja, S. ; Chen, Dongyan ; Li, Lei ; Trivedi, Kishor S. ; Some, Raphael R. ; Nikora, Allen P.
Author_Institution :
Dept. of Electr. & Comput. Eng., Duke Univ., Durham, NC, USA
Abstract :
The NASA Remote Exploration and Experimentation (REE) Project, managed by the Jet Propulsion Laboratory, has the vision of bringing commercial supercomputing technology into space, in a form which meets the demanding environmental requirements, to enable a new class of science investigation and discovery. Dependability goals of the REE system are 99% reliability over 5 years and 99% availability. In this paper we focus on the reliability/availability modeling and analysis of the REE system. We carry out this task using fault trees, reliability block diagrams, stochastic reward nets and hierarchical models. Our analysis helps to determine the ranges of parameters for which the REE dependability goal will be met. The analysis also allows us to assess different hardware and software fault-tolerance techniques.
Keywords :
aerospace computing; fault tolerant computing; fault trees; parallel architectures; parallel programming; stochastic processes; system recovery; Jet Propulsion Laboratory; NASA Remote Exploration and Experimentation Project; availability analysis; dependability; fault trees; hardware fault tolerance techniques; hierarchical models; modeling; reliability analysis; reliability block diagrams; software fault tolerance techniques; stochastic reward nets; supercomputing technology; Availability; Environmental management; Fault trees; Hardware; NASA; Project management; Propulsion; Space technology; Stochastic processes; Technology management;
Conference_Titel :
Dependable Systems and Networks, 2002. DSN 2002. Proceedings. International Conference on
Print_ISBN :
0-7695-1101-5
DOI :
10.1109/DSN.2002.1028918