DocumentCode :
3165204
Title :
Dynamic Pattern Detection with Temporal Consistency and Connectivity Constraints
Author :
Speakman, Skyler ; Yating Zhang ; Neill, Daniel B.
Author_Institution :
Event & Pattern Detection Lab., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
697
Lastpage :
706
Abstract :
We explore scalable and accurate dynamic pattern detection methods in graph-based data sets. We apply our proposed Dynamic Subset Scan method to the task of detecting, tracking, and source-tracing contaminant plumes spreading through a water distribution system equipped with noisy, binary sensors. While static patterns affect the same subset of data over a period of time, dynamic patterns may affect different subsets of the data at each time step. These dynamic patterns require a new approach to define and optimize penalized likelihood ratio statistics in the subset scan framework, as well as new computational techniques that scale to large, real-world networks. To address the first concern, we develop new subset scan methods that allow the detected subset of nodes to change over time, while incorporating temporal consistency constraints to reward patterns that do not dramatically change between adjacent time steps. Second, our Additive Graph Scan algorithm allows our novel scan statistic to process small graphs (500 nodes) in 4.1 seconds on average while maintaining an approximation ratio over 99% compared to an exact optimization method, and to scale to large graphs with over 12,000 nodes in 30 minutes on average. Evaluation results across multiple detection, tracking, and source-tracing tasks demonstrate substantial performance gains achieved by the Dynamic Subset Scan approach.
Keywords :
contamination; data mining; environmental science computing; graph theory; sensor fusion; statistical analysis; additive graph scan algorithm; approximation ratio; connectivity constraints; dynamic pattern detection method; dynamic subset scan method; exact optimization method; graph-based data sets; multiple detection; noisy binary sensors; penalized likelihood ratio statistics; sensor fusion; source-tracing contaminant plumes; static patterns; temporal consistency; temporal consistency constraints; water distribution system; Additives; Equations; Heuristic algorithms; Mathematical model; Optimization; Sensor systems; likelihood ratio statistics; sensor fusion; spatial and subset scan statistics; water distribution systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2013.66
Filename :
6729554
Link To Document :
بازگشت