Title :
Scalable Failure Management for Peer-to-Peer Networks
Author :
Catalin Leordeanu;Vlad Calina;Valentin Cristea
fDate :
7/1/2012 12:00:00 AM
Abstract :
Failure management is a key component in the attempt to provide a reliable environment. This article proposes a solution to increase the reliability of distributed systems based on the Chord Peer-to-Peer overlay. our solution is aimed at providing accurate failure information about the nodes in the system. This is a very difficult task in Peer-to-peer networks due to their dynamic nature and the inability to obtain reliable data from failure detectors. We propose a failure history service used to share failure information between peer-to-peer nodes. This novel service ensures that the information about the current state of a node, as well as its failure history, is as accurate as possible even when facing a large number of node failures. This solution aims to increase the reliability of distributed systems based on the Chord peer-to-peer overlay by providing accurate data which can be used to analyze failures over time.
Keywords :
"Peer to peer computing","Detectors","History","Monitoring","Protocols","Software reliability"
Conference_Titel :
Complex, Intelligent and Software Intensive Systems (CISIS), 2012 Sixth International Conference on
Print_ISBN :
978-1-4673-1233-2
DOI :
10.1109/CISIS.2012.193