Title :
A benchmark for fault monitors in distributed systems
Author :
Hussain, Shujaat ; Qadir, Muhammad Abdul
Author_Institution :
Center for Distrib. & Semantic Comput., Mohammad Ali Jinnah Univ., Islamabad, Pakistan
Abstract :
Fault monitoring is one of the main activities of fault tolerant distributed systems. It is required to determine the suspected/crashed component and proactively take the recovery steps to keep the system alive. The main objective of the fault monitoring activity is to quickly and correctly identify the faults. There are many techniques for fault monitoring which have general and specific parameters which influence their performance. In this paper we find the parameters that can help us classify the fault monitoring techniques. We created a benchmark ACI (adaptation, convergence, intelligence) and applied it on current techniques.
Keywords :
benchmark testing; distributed processing; software fault tolerance; system monitoring; adaptation benchmark; convergence benchmark; fault monitoring; fault tolerant distributed systems; intelligence benchmark; Computer crashes; Condition monitoring; Convergence; Delay effects; Detectors; Distributed computing; Fault detection; Fault diagnosis; Fault tolerant systems; Remote monitoring; adaptation; benchmark; fault detectors; fault monitoring; timeout;
Conference_Titel :
Emerging Technologies, 2009. ICET 2009. International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4244-5630-7
Electronic_ISBN :
978-1-4244-5631-4
DOI :
10.1109/ICET.2009.5353193