DocumentCode :
2132202
Title :
Optimal deadlock detection in distributed systems based on locally constructed wait-for graphs
Author :
Chen, Shigang ; Deng, Yi ; Attie, Paul ; Sun, Wei
Author_Institution :
Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
fYear :
1996
fDate :
27-30 May 1996
Firstpage :
613
Lastpage :
619
Abstract :
We present a new algorithm for detecting generalized deadlocks in distributed systems. Our algorithm incrementally constructs and reduces a wait-for graph (WFG) at an initiator process. This WFG is then searched for deadlock. The proposed algorithm has two primary advantages: First, it avoids sending messages along the edges of the global wait-for graph (WFG), thereby achieving a worst-case message complexity of 2n, where n is the number of processes in the WFG. Since information must be obtained from every process reachable from the initiator, this is optimal to within a constant factor. All the existing algorithms for the same problem construct a distributed snapshot of the WFG. As this involves sending messages along the edges of the WFG, the best available message complexity among these algorithms is 4e-2n+2l, which is O(n2) in the worst case, where e and l are the number of edges and leaves in the WFG, respectively. Second, since the information about a detected deadlock is readily available at the initiator process, rather than distributed among different processes, it significantly simplifies the task of deadlock resolution, and helps to reduce system overhead associated with the resolution. The time complexity of our algorithm is also better than or equal to the existing algorithms
Keywords :
communication complexity; computational complexity; concurrency control; distributed processing; network operating systems; deadlock resolution; distributed snapshot; distributed systems; generalized deadlocks; locally constructed wait-for graphs; optimal deadlock detection; system overhead; time complexity; worst-case message complexity; Computer science; Contracts; Detection algorithms; Laboratories; NASA; Sun; System recovery; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems, 1996., Proceedings of the 16th International Conference on
Print_ISBN :
0-8186-7399-0
Type :
conf
DOI :
10.1109/ICDCS.1996.508012
Filename :
508012
Link To Document :
بازگشت