Title :
Centralized failure injection for distributed, fault-tolerant protocol testing
Author :
Alvarez, Guillermo A. ; Cristian, Flaviu
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
Abstract :
We describe a centralized approach to testing that distributed fault-tolerant protocols satisfy their safety and timeliness specifications in the presence of the very failures they are designed to tolerate. CESIUM is a testing environment based on the centralized simulation of distributed executions and failures. Processes are run in a single address space while providing the appearance of a truly distributed execution. The human tester can force the occurrence of arbitrary failures and security attacks. The implementations under test are not instrumented for testing purposes, and their source codes need not be available. We prove that CESIUM can execute exactly the set of runs feasible in the real distributed system being simulated. We also show that there are safety and timeliness properties in the specifications of many existing distributed protocols that cannot be tested in practical distributed systems. All of these properties can, however, be accurately tested by CESIUM without introducing any perturbation in test experiments
Keywords :
fault tolerant computing; formal specification; performance evaluation; protocols; CESIUM; arbitrary failures; centralized failure injection; centralized simulation; distributed protocol testing; fault-tolerant protocol testing; safety; security attacks; single address space; specifications; testing environment; timeliness specifications; truly distributed execution; Clocks; Delay; Fault tolerance; Humans; Instruments; Protocols; Safety; Security; System testing; Vehicle crash testing;
Conference_Titel :
Distributed Computing Systems, 1997., Proceedings of the 17th International Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
0-8186-7813-5
DOI :
10.1109/ICDCS.1997.597856