• DocumentCode
    1058643
  • Title

    A-GAP: An Adaptive Protocol for Continuous Network Monitoring with Accuracy Objectives

  • Author

    Prieto, Alberto Gonzalez ; Stadler, Rolf

  • Author_Institution
    Sch. of Electr. Eng., KTH R. Inst. of Technol., Stockholm
  • Volume
    4
  • Issue
    1
  • fYear
    2007
  • fDate
    6/1/2007 12:00:00 AM
  • Firstpage
    2
  • Lastpage
    12
  • Abstract
    We present A-GAP, a novel protocol for continuous monitoring of network state variables, which aims at achieving a given monitoring accuracy with minimal overhead. Network state variables are computed from device counters using aggregation functions, such as SUM, AVERAGE and MAX. The accuracy objective is expressed as the average estimation error. A-GAP is decentralized and asynchronous to achieve robustness and scalability. It executes on an overlay that interconnects management processes on the devices. On this overlay, the protocol maintains a spanning tree and updates the network state variables through incremental aggregation. Based on a stochastic model, it dynamically configures local filters that control whether an update is sent towards the root of the tree. We evaluate A-GAP through simulation using real traces and two different types of topologies of up to 650 nodes. The results show that we can effectively control the trade-off between accuracy and protocol overhead, and that the overhead can be reduced by almost two orders of magnitude when allowing for small errors. The protocol quickly adapts to a node failure and exhibits short spikes in the estimation error. Lastly, it can provide an accurate estimate of the error distribution in real-time.
  • Keywords
    error statistics; estimation theory; protocols; stochastic processes; telecommunication network management; telecommunication network reliability; trees (mathematics); adaptive protocol; aggregation function; average estimation error; continuous network monitoring; device counter; estimation error distribution; incremental aggregation; node failure; spanning tree; stochastic model; Computer networks; Counting circuits; Estimation error; Filters; Monitoring; Network topology; Protocols; Robustness; Scalability; Stochastic processes;
  • fLanguage
    English
  • Journal_Title
    Network and Service Management, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1932-4537
  • Type

    jour

  • DOI
    10.1109/TNSM.2007.030101
  • Filename
    4275030