DocumentCode :
751718
Title :
Toward Optimal Network Fault Correction in Externally Managed Overlay Networks
Author :
Lee, Patrick P C ; Misra, Vishal ; Rubenstein, Dan
Author_Institution :
Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, Shatin, China
Volume :
21
Issue :
3
fYear :
2010
fDate :
3/1/2010 12:00:00 AM
Firstpage :
354
Lastpage :
366
Abstract :
We consider an end-to-end approach of inferring probabilistic data forwarding failures in an externally managed overlay network, where overlay nodes are independently operated by various administrative domains. Our optimization goal is to minimize the expected cost of correcting (i.e., diagnosing and repairing) all faulty overlay nodes that cannot properly deliver data. Instead of first checking the most likely faulty nodes as in conventional fault localization problems, we prove that an optimal strategy should start with checking one of the candidate nodes, which are identified based on a potential function that we develop. We propose several efficient heuristics for inferring the best node to be checked in large-scale networks. By extensive simulation, we show that we can infer the best node in at least 95 percent of time, and that first checking the candidate nodes rather than the most likely faulty nodes can decrease the checking cost of correcting all faulty nodes.
Keywords :
computer network management; fault diagnosis; telecommunication network reliability; externally managed overlay networks; large-scale networks; optimal network fault correction; Network management; fault localization and repair; network diagnosis and correction; reliability engineering.;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2009.66
Filename :
4840335
Link To Document :
بازگشت