Title :
Optimal Skampling for the Flow Size Distribution
Author :
Veitch, Darryl ; Tune, Paul
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of Melbourne, Melbourne, VIC, Australia
Abstract :
We introduce a new method of data collection for flow size estimation, the optimized flow sampling sketch, which combines the optimal properties of flow sampling with the computational advantages of a counter array sketch. Using Fisher information as a definitive basis of comparison, we show that the statistical efficiency of the method is within a constant factor of that of flow sampling, which is known to be optimal but which cannot be implemented without a flow table, which has higher memory and computational costs. In the process, we derive new results on the Fisher information theoretic and variance properties of the counter array sketch, proving that an overloaded sketch actually destroys information. We revisit the `eviction sketch´ of Ribeiro et al. using the Fisher information framework. We show that its performance is much higher than previously supposed, and we define a new method, the optimized eviction sketch, which has very high efficiency. We compare these methods against each other and a third skampling method, sketch guided sampling, theoretically, on models and on data.
Keywords :
Internet; sampling methods; telecommunication traffic; transport protocols; Fisher information; TCP sequence; counter array sketch; data collection; eviction sketch; flow size distribution; flow size estimation; optimized flow sampling sketch; sketch guided sampling; statistical efficiency; variance property; Estimation; Frequency selective surfaces; Radiation detectors; Sampling methods; Size measurement; Vectors; Cram??r-Rao lower bounds; Cramer-Rao lower bounds; Internet; Maximum Likelihood Estimation; internet; maximum likelihood estimation; sampling methods; sketching;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2015.2418770