• DocumentCode
    3076227
  • Title

    Assessing Asymmetric Fault-Tolerant Software

  • Author

    Popov, Peter ; Strigini, Lorenzo

  • Author_Institution
    Centre for Software Reliability, City Univ. London, London, UK
  • fYear
    2010
  • fDate
    1-4 Nov. 2010
  • Firstpage
    41
  • Lastpage
    50
  • Abstract
    The most popular forms of fault tolerance against design faults use "asymmetric" architectures in which a "primary" part performs the computation and a "secondary" part is in charge of detecting errors and performing some kind of error processing and recovery. In contrast, the most studied forms of software fault tolerance are "symmetric" ones, e.g. N-version programming. The latter are often controversial, the former are not. We discuss how to assess the dependability gains achieved by these methods. Substantial difficulties have been shown to exist for symmetric schemes, but we show that the same difficulties affect asymmetric schemes. Indeed, the latter present somewhat subtler problems. In both cases, to predict the dependability of the fault-tolerant system it is not enough to know the dependability of the individual components. We extend to asymmetric architectures the style of probabilistic modeling that has been useful for describing the dependability of "symmetric" architectures, to highlight factors that complicate the assessment. In the light of these models, we finally discuss fault injection approaches to estimating coverage factors. We highlight the limits of what can be predicted and some useful research directions towards clarifying and extending the range of situations in which estimates of coverage of fault tolerance mechanisms can be trusted.
  • Keywords
    software fault tolerance; N-version programming; asymmetric fault-tolerant software; error processing; error recovery; fault injection approach; software fault tolerance; Benchmark testing; Computer architecture; Fault tolerance; Fault tolerant systems; Probability distribution; Programming; Software; checker coverage; dependability benchmarking; fault injection; software fault tolerance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Reliability Engineering (ISSRE), 2010 IEEE 21st International Symposium on
  • Conference_Location
    San Jose, CA
  • ISSN
    1071-9458
  • Print_ISBN
    978-1-4244-9056-1
  • Electronic_ISBN
    1071-9458
  • Type

    conf

  • DOI
    10.1109/ISSRE.2010.10
  • Filename
    5635113