Title :
Reliability measurement of mass storage system for onboard instrumentation
Author :
Choi, Minsu ; Park, Nohpill ; Piuri, Vincenzo ; Lombardi, Fabrizio
Author_Institution :
Dept. Electr. & Comput. Eng., Univ. of Missouri-Rolla, Rolla, MO, USA
Abstract :
Advances in spaceborne vehicular technology have made possible the long-life duration of the mission in harsh cosmic environments. Reliability and data integrity are the commonly emphasized requirements of spaceborne solid-state mass storage systems, because faults due to the harsh cosmic environments, such as extreme radiation, can be experienced throughout the mission. Acceptable dependability for these instruments has been achieved by using redundancy and repair. Reconfiguration (repair) of memory arrays using spare memory lines is the most common technique for reliability enhancement of memories with faults. Faulty cells in memory arrays are known to show spatial locality. This physical phenomenon is referred to as fault clustering . This paper initially investigates a quadrat-based fault model for memory arrays under clustered faults to establish a reliable foundation of measurement. Then, lifelong dependability of a fault-tolerant spaceborne memory system with hierarchical active redundancy, which consists of spare columns in each memory module and redundant memory modules, is measured in terms of the reliability (i.e., the conditional probability that the system performs correctly throughout the mission) and mean-time-to-failure (i.e., the expected time that a system will operate before it fails). Finally, minimal column redundancy search technique for the fault-tolerant memory system is proposed and verified through a series of parametric simulations. Thereby, design and fabrication of cost-effective and highly reliable fault-tolerant onboard mass storage system can be realized for dependable instrumentation.
Keywords :
aerospace instrumentation; data integrity; fault tolerant computing; redundancy; storage management chips; data integrity; fault clustering; fault-tolerant spaceborne memory system; faulty cells; harsh cosmic environments; hierarchical active redundancy; mass storage system; mean time to failure; memory array reconfiguration; onboard instrumentation; quadrat-based fault model; redundancy minimization; reliability measurement; spaceborne vehicular technology; spare memory lines; Circuit faults; Fault tolerant systems; Instruments; Lattices; Redundancy; Semiconductor device modeling; Single event transient; Single event upset; Solid state circuits; Vehicular and wireless technologies; Clustered faults; hierarchical active redundancy; mean-time-to-failure (MTTF); memory reconfiguration (repair); onboard mass storage system; quadrat-based fault model; redundancy minimization; reliability;
Journal_Title :
Instrumentation and Measurement, IEEE Transactions on
DOI :
10.1109/TIM.2005.858514