• DocumentCode
    1241995
  • Title

    Reliability measurement of mass storage system for onboard instrumentation

  • Author

    Choi, Minsu ; Park, Nohpill ; Piuri, Vincenzo ; Lombardi, Fabrizio

  • Author_Institution
    Dept. Electr. & Comput. Eng., Univ. of Missouri-Rolla, Rolla, MO, USA
  • Volume
    54
  • Issue
    6
  • fYear
    2005
  • Firstpage
    2297
  • Lastpage
    2304
  • Abstract
    Advances in spaceborne vehicular technology have made possible the long-life duration of the mission in harsh cosmic environments. Reliability and data integrity are the commonly emphasized requirements of spaceborne solid-state mass storage systems, because faults due to the harsh cosmic environments, such as extreme radiation, can be experienced throughout the mission. Acceptable dependability for these instruments has been achieved by using redundancy and repair. Reconfiguration (repair) of memory arrays using spare memory lines is the most common technique for reliability enhancement of memories with faults. Faulty cells in memory arrays are known to show spatial locality. This physical phenomenon is referred to as fault clustering . This paper initially investigates a quadrat-based fault model for memory arrays under clustered faults to establish a reliable foundation of measurement. Then, lifelong dependability of a fault-tolerant spaceborne memory system with hierarchical active redundancy, which consists of spare columns in each memory module and redundant memory modules, is measured in terms of the reliability (i.e., the conditional probability that the system performs correctly throughout the mission) and mean-time-to-failure (i.e., the expected time that a system will operate before it fails). Finally, minimal column redundancy search technique for the fault-tolerant memory system is proposed and verified through a series of parametric simulations. Thereby, design and fabrication of cost-effective and highly reliable fault-tolerant onboard mass storage system can be realized for dependable instrumentation.
  • Keywords
    aerospace instrumentation; data integrity; fault tolerant computing; redundancy; storage management chips; data integrity; fault clustering; fault-tolerant spaceborne memory system; faulty cells; harsh cosmic environments; hierarchical active redundancy; mass storage system; mean time to failure; memory array reconfiguration; onboard instrumentation; quadrat-based fault model; redundancy minimization; reliability measurement; spaceborne vehicular technology; spare memory lines; Circuit faults; Fault tolerant systems; Instruments; Lattices; Redundancy; Semiconductor device modeling; Single event transient; Single event upset; Solid state circuits; Vehicular and wireless technologies; Clustered faults; hierarchical active redundancy; mean-time-to-failure (MTTF); memory reconfiguration (repair); onboard mass storage system; quadrat-based fault model; redundancy minimization; reliability;
  • fLanguage
    English
  • Journal_Title
    Instrumentation and Measurement, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9456
  • Type

    jour

  • DOI
    10.1109/TIM.2005.858514
  • Filename
    1542529