Title :
Understanding Data Completeness in Network Monitoring Systems
Author :
Korn, Flip ; Ruilin Liu ; Hui Wang
Author_Institution :
AT&T Labs.-Res., Florham Park, NJ, USA
Abstract :
In many networks including Internet Service Providers, transportation monitoring systems and the electric grid, measurements from a set of objects are continuously taken over time and used for important decisions such as provisioning and pricing. It is therefore vital to understand the completeness and reliability of such data. Given the large volume of data generated by these systems, rather than enumerating the times and objects incurring missing or spurious data, it is more effective to provide patterns (groups of tuples) concisely summarizing trends that may not otherwise be readily observable. In this paper, we define the Graph Tableau Discovery Problem where the measured tuples can be thought of as edges in a bipartite graph on an ordered attribute (time) and an unordered attribute (object identifiers). We define the problem of finding an optimal summary, show that it is NP-complete, and then provide a polynomial-time approximation algorithm with guarantees to find a good summary. Experiments on real and synthetic data demonstrate the effectiveness and efficiency of our approach.
Keywords :
computational complexity; data handling; graph theory; polynomial approximation; Internet service providers; NP-complete; data completeness; electric grid; graph tableau discovery problem; network monitoring systems; polynomial-time approximation algorithm; pricing; transportation monitoring systems; Approximation algorithms; Approximation methods; Greedy algorithms; Loss measurement; Monitoring; Silicon; Time measurement;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.149