• DocumentCode
    3565718
  • Title

    Modeling conservative updates in multi-hash approximate count sketches

  • Author

    Bianchi, Giuseppe ; Duffy, Ken ; Leith, Douglas ; Shneer, Vsevolod

  • Author_Institution
    CNIT, Univ. Roma Tor Vergata, Rome, Italy
  • fYear
    2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Multi-hash-based count sketches are fast and memory efficient probabilistic data structures that are widely used in scalable online traffic monitoring applications. Their accuracy significantly improves with an optimization, called conservative update, which is especially effective when the aim is to discriminate a relatively small number of heavy hitters in a traffic stream consisting of an extremely large number of flows. Despite its widespread application, a thorough understanding of the conservative update operation has lagged behind, perhaps because of the significant modeling complexity involved. In this work we attempt to fill this gap. Our proposed modeling approach builds on a practically important empirical finding: simulation results (as well as experimental ones over real traffic traces) obtained for skewed load scenarios exhibit a sharp waterfall-type behaviour. That is, the approximate count provided by the sketch response remains accurate until an “error floor” is reached. Flows below this error flow level are on average approximated by the same error floor count value, irrespective of their exact count. The error floor itself appears to be maximal in the case of uniform load. Leveraging the simplifications made possible when the load is uniform, we derive an analytic model capable of accurately predicting the transient growth behavior of the (tightly correlated) counters deployed in the data structure and obtain an upper bound on the error floor level.
  • Keywords
    cryptography; data models; data structures; optimisation; probability; conservative update modelling; error floor count value; error flow level; memory efficient probabilistic data structures; multihash approximate count sketches; scalable online traffic monitoring; traffic stream; transient growth behavior; upper bound; waterfall-type behaviour; Accuracy; Analytical models; Equations; Load modeling; Mathematical model; Optimization; Radiation detectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Teletraffic Congress (ITC 24), 2012 24th International
  • Print_ISBN
    978-1-4673-1292-9
  • Electronic_ISBN
    978-0-9836283-3-0
  • Type

    conf

  • Filename
    6331813