DocumentCode :
1081468
Title :
Improved task-allocation algorithms to maximize reliability of redundant distributed computing systems
Author :
Kartik, S. ; Murthy, C. Siva Ram
Author_Institution :
Wipro Infotech Ltd., Bangalore, India
Volume :
44
Issue :
4
fYear :
1995
fDate :
12/1/1995 12:00:00 AM
Firstpage :
575
Lastpage :
586
Abstract :
This paper considers the problem of finding an optimal and sub-optimal task-allocation (assign the processing node for each module of a task or program) in redundant distributed computing systems, with the goal of maximizing system-reliability (probability that the system completes the entire task successfully). Finding an optimal task-allocation is NP-hard in the strong sense. An efficient algorithm is presented for optimal task-allocation in a distributed computing system with level-2 redundancy. The algorithm, (a) uses branch and bound with underestimates for reducing the average time complexity of optimal task-allocation computations, and (b) reorders the list of modules to allow a subset of modules that does not intra-communicate to be assigned last, for further reduction in the computations of optimal task-allocation for maximizing reliability. An efficient heuristics algorithm is given which obtains sub-optimal solutions in a reasonable computation time. The performance of our algorithms is given over a wide range of parameters such as number of modules, number of processing nodes, ratio of average execution cost to average communication cost, and connectivity of modules. The effectiveness of the optimal task-allocation algorithm is demonstrated by comparing it to a competing optimal task-allocation algorithm, for maximizing reliability. The performance of our algorithm improves very much when the difference between the number of modules and the connectivity increases
Keywords :
distributed processing; fault tolerant computing; redundancy; reliability; resource allocation; task analysis; average execution/communication cost ratio; average time complexity reduction; branch and bound; heuristics algorithm; level-2 redundancy; module connectivity; optimal task-allocation; redundant distributed computing systems; reliability maximisation; search tree; sub-optimal task-allocation; task-allocation algorithms; Cost function; Distributed computing; Fault tolerant systems; Heuristic algorithms; Multiprocessor interconnection networks; Real time systems; Telecommunication network reliability; Telephony;
fLanguage :
English
Journal_Title :
Reliability, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9529
Type :
jour
DOI :
10.1109/24.475976
Filename :
475976
Link To Document :
بازگشت