• DocumentCode
    3806513
  • Title

    Effect of System Processes on Error Repetition: A Probabilistic and Measurement Approach

  • Author

    Anna Hac

  • Author_Institution
    The Johns Hopkins University, Baltimore
  • Volume
    35
  • Issue
    5
  • fYear
    1986
  • Firstpage
    494
  • Lastpage
    497
  • Abstract
    This paper presents an approach to system reliability modeling where failures and errors are not statistically independent. The repetition of failures and errors until their causes are removed is affected by the system processes and degrades system reliability. Four types of failures are introduced: hardware transients, software and hardware design errors, and program faults. Probability of failure, mean time to failure, and system reliability depend on the type of failure. Actual measurements show that the most critical factor for system reliability is the time after occurrence of a failure when this failure can be repeated in every process that accesses a failed component. An example involving measurements collected in an IBM 4331 installation validates the model and shows its applications. The degradation of system reliability can be appreciable even for very short periods of time. This is why the conditional probability of repetition of failures is introduced. The reliability model allows prediction of system reliability based on the calculation of the mean time to failure. The comparison with the measurement results shows that the model with process dependent repetition of failures approximates system reliability with better accuracy than the model with the assumption of independent failures.
  • Keywords
    "Reliability","Degradation","Failure analysis","Hardware","Probability","Time measurement","Predictive models","Databases","Computer errors","Software design"
  • Journal_Title
    IEEE Transactions on Reliability
  • Publisher
    ieee
  • ISSN
    0018-9529
  • Type

    jour

  • DOI
    10.1109/TR.1986.4335527
  • Filename
    4335527