Title :
Adaptive distributed sorting
Author :
Alari, Gianluigi ; Bourgon, Brian ; Chacko, Joseph ; Datta, Ajoy Kumar
Author_Institution :
Unite d´´Inf., Katholieke Univ., Leuven, Belgium
Abstract :
The paper presents a fault tolerant distributed sorting algorithm designed to operate in a network with a tree topology subject to transient faults, crashes and joins of both links and nodes. The distributed sorting problem can be informally described as follows: nodes cooperate to reach a global configuration, where every node, depending on its id, is assigned a specific final value taken from the set of the original values across all nodes. Each node has a initial non-corruptible id and value. Fault tolerance is achieved using Dijkstra´s paradigm of self-stabilization. A self-stabilizing algorithm, regardless of the initial system state, will converge in finite time to a set of safe legitimate states through embedding fault tolerance without the need to recur to explicit exception handlers or backward recovery. Our solution is based on continuous information flow to provide each node with a consistent view of the system state. It requires the knowledge of n, the number of nodes in the system, in order to correctly stabilize and it has O(d) time complexity and O(n) memory requirement where d is the diameter of the tree network. The algorithm handles transient faults by weeding out false information in the system. Nodes can start with completely false information concerning the values, ids and final values in the system, yet the intended behavior is still achieved. Also, nodes and links are allowed to crash, provided the network remains connected. Additionally, new nodes and links can join at any time
Keywords :
adaptive systems; computational complexity; distributed algorithms; software fault tolerance; sorting; adaptive distributed sorting; consistent view; continuous information flow; distributed sorting problem; fault tolerant distributed sorting algorithm; global configuration; initial non corruptible id; memory requirement; safe legitimate states; self stabilizing algorithm; time complexity; transient faults; tree network; tree topology; Algorithm design and analysis; Computer crashes; Convergence; Counting circuits; Fault tolerance; Fault tolerant systems; Hardware; Intrusion detection; Software algorithms; Sorting;
Conference_Titel :
Computers and Communications, 1996., Conference Proceedings of the 1996 IEEE Fifteenth Annual International Phoenix Conference on
Conference_Location :
Scottsdale, AZ
Print_ISBN :
0-7803-3255-5
DOI :
10.1109/PCCC.1996.493605