Title :
FAST Failure Detection Service for Large Scale Distributed Systems
Author :
Kalewski, M. ; Kobusinska, Anna ; Kobusinski, J.
Author_Institution :
Inst. of Comput. Sci., Poznan Univ. of Technol., Poznan
Abstract :
This paper addresses the problem of building a failure detection service for large scale distributed systems. We describe failure detection service, which merges some novel proposals and satisfies scalability, flexibility and adaptability properties. Afterwards, we present the architecture of such a service, show detailed information about its components and present the simulation results concerning performance.
Keywords :
distributed processing; failure analysis; software reliability; distributed computing; fast failure detection service; large scale distributed systems; Costs; Delay; Detectors; Distributed computing; Large-scale systems; Network topology; Peer to peer computing; Proposals; Scalability; Telecommunication traffic; distributed systems; failure detection; fault tolerance; large scale systems; probabilistic protocols;
Conference_Titel :
Parallel, Distributed and Network-based Processing, 2009 17th Euromicro International Conference on
Conference_Location :
Weimar
Print_ISBN :
978-0-7695-3544-9
DOI :
10.1109/PDP.2009.33