• DocumentCode
    3177941
  • Title

    An object-oriented testbed for the evaluation of checkpointing and recovery systems

  • Author

    Ramamurthy, B. ; Upadhyaya, S.J. ; Iyer, R.K.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., State Univ. of New York, Buffalo, NY, USA
  • fYear
    1997
  • fDate
    24-27 June 1997
  • Firstpage
    194
  • Lastpage
    203
  • Abstract
    The paper presents the design and development of an object-oriented testbed for simulation and analysis of checkpointing and recovery schemes in distributed systems. An important contribution, of the testbed is a unified environment that provides a set of specialized components for easy and detailed simulation of checkpointing and recovery schemes. The testbed allows a designer to mix and match different components either to study the effectiveness of a particular scheme or to freely experiment with hybrid designs before the actual implementation. The testbed also facilitates the evaluation of interdependencies among the various parameters such as communication and application dynamics and their effect on the performance of checkpointing and recovery schemes. The implementation of the testbed as an extension of DEPEND which is an integrated design and fault-injection environment, provides for unique system-level dependability analysis under realistic fault conditions unlike existing simulation tools. The authors illustrate the versatility of the testbed by using four diverse applications, ranging from the comparison of performances of two checkpointing and recovery schemes to the study of the effect of checkpoint size.
  • Keywords
    distributed processing; object-oriented methods; object-oriented programming; software performance evaluation; system recovery; testing; virtual machines; application dynamics; checkpoint size effect; checkpointing system evaluation; communication dynamics; distributed systems; extended DEPEND; hybrid designs; integrated design/fault-injection; object-oriented testbed; parameter interdependencies; performances; realistic fault conditions; recovery system evaluation; simulation; specialized components; system-level dependability analysis; Analytical models; Application software; Checkpointing; Computational modeling; Computer simulation; Design engineering; Distributed computing; Object oriented modeling; Software testing; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1997. FTCS-27. Digest of Papers., Twenty-Seventh Annual International Symposium on
  • Conference_Location
    Seattle, WA, USA
  • ISSN
    0731-3071
  • Print_ISBN
    0-8186-7831-3
  • Type

    conf

  • DOI
    10.1109/FTCS.1997.614092
  • Filename
    614092