DocumentCode :
2975871
Title :
Detecting and localizing large-scale router failures using active probes
Author :
Zheng, Qiang ; Cao, Guohong ; La Porta, Tom ; Swami, Ananthram
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear :
2011
fDate :
7-10 Nov. 2011
Firstpage :
1170
Lastpage :
1175
Abstract :
Detecting the occurrence of large-scale router failures and localizing the failed routers are critical to enhancing network reliability. We propose a two-phase approach for detecting and localizing large-scale router failures using traceroute-like active probes. To detect large-scale router failures, the detection phase is periodically invoked to probe all routers. When detecting large-scale router failures, the localization phase is triggered to identify the failed routers.We reduce the probing cost by avoiding three types of useless probes. For the routers whose status cannot be identified by probes, we develop a distance based method to estimate their failure probability. Experimental results based on ISP topologies show that the accuracy of our approach is higher than 96.5%, even when only 10% of routers are connected by end systems for probing. Compared with prior works, the proposed approach achieves much higher accuracy with lower probing cost.
Keywords :
computer network reliability; probability; telecommunication network routing; detection phase; failure probability; large-scale router failure detection; large-scale router failure localization; network reliability; traceroute-like active probes; two-phase approach; Accuracy; IP networks; Probability; Probes; Reliability; Routing; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
MILITARY COMMUNICATIONS CONFERENCE, 2011 - MILCOM 2011
Conference_Location :
Baltimore, MD
ISSN :
2155-7578
Print_ISBN :
978-1-4673-0079-7
Type :
conf
DOI :
10.1109/MILCOM.2011.6127458
Filename :
6127458
Link To Document :
بازگشت