Title :
Discovering Flow Anomalies: A SWEET Approach
Author :
Kang, James M. ; Shekhar, Shashi ; Wennen, Christine ; Novak, Paige
Author_Institution :
Dept. of Comput. Sci., Univ. of Minnesota, Minneapolis, MN
Abstract :
Given a percentage-threshold and readings from a pair of consecutive upstream and downstream sensors, flow anomaly discovery identifies dominant time intervals where the fraction of time instants of significantly mis-matched sensor readings exceed the given percentage-threshold. Discovering flow anomalies (FA) is an important problem in environmental flow monitoring networks and early warning detection systems for water quality problems. However, mining FAs is computationally expensive because of the large (potentially infinite) number of time instants of measurement and potentially long delays due to stagnant (e.g. lakes) or slow moving (e.g. wetland) water bodies between consecutive sensors. Traditional outlier detection methods (e.g. t-test) are suited for detecting transient FAs (i.e., time instants of significant mis-matches across consecutive sensors) and cannot detect persistent FAs (i.e., long variable time-windows with a high fraction of time instant transient FAs) due to a lack of a pre-defined window size. In contrast, we propose a Smart Window Enumeration and Evaluation of persistence-Thresholds (SWEET) method to efficiently explore the search space of all possible window lengths. Computation overhead is brought down significantly by restricting the start and end points of a window to coincide with transient FAs, using a smart counter and efficient pruning techniques. Experimental evaluation using a real dataset shows our proposed approach outperforms Nainodotve alternatives.
Keywords :
condition monitoring; data mining; environmental science computing; water quality; early warning detection systems; environmental flow monitoring networks; flow anomaly discovery; pruning techniques; real dataset; search space; smart counter; smart window enumeration and evaluation of persistence-thresholds method; water bodies; water quality problems; Birth disorders; Condition monitoring; Contamination; Lakes; Petroleum; Pollution measurement; Rivers; Sensor phenomena and characterization; Water pollution; Water resources;
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3502-9
DOI :
10.1109/ICDM.2008.117