Title :
Toward Optimal Network Fault Correction via End-to-End Inference
Author :
Lee, Patrick P C ; Misra, Vishal ; Rubenstein, Dan
Author_Institution :
Columbia Univ., New York
Abstract :
We consider an end-to-end approach of inferring network faults that manifest in multiple protocol layers, with an optimization goal of minimizing the expected cost of correcting all faulty nodes. Instead of first checking the most likely faulty nodes as in conventional fault localization problems, we prove that an optimal strategy should start with checking one of the candidate nodes, which are identified based on a potential function that we develop. We propose several efficient heuristics for inferring the best node to be checked in large-scale networks. By extensive simulation, we show that we can infer the best node in at least 95%, and that checking first the candidate nodes rather than the most likely faulty nodes can decrease the checking cost of correcting all faulty nodes by up to 25%.
Keywords :
fault diagnosis; reliability; telecommunication network management; end-to-end inference; fault localization; multiple protocol layers; network diagnosis; optimal network fault correction; reliability engineering; Communications Society; Cost function; Fault diagnosis; Large-scale systems; Monitoring; Network topology; Peer to peer computing; Protocols; Routing; Spine;
Conference_Titel :
INFOCOM 2007. 26th IEEE International Conference on Computer Communications. IEEE
Conference_Location :
Anchorage, AK
Print_ISBN :
1-4244-1047-9
DOI :
10.1109/INFCOM.2007.159