DocumentCode :
610393
Title :
Finding interesting correlations with conditional heavy hitters
Author :
Mirylenka, K. ; Palpanas, T. ; Cormode, G. ; Srivastava, Divesh
Author_Institution :
Univ. of Trento, Trento, Italy
fYear :
2013
fDate :
8-12 April 2013
Firstpage :
1069
Lastpage :
1080
Abstract :
The notion of heavy hitters-items that make up a large fraction of the population - has been successfully used in a variety of applications across sensor and RFID monitoring, network data analysis, event mining, and more. Yet this notion often fails to capture the semantics we desire when we observe data in the form of correlated pairs. Here, we are interested in items that are conditionally frequent: when a particular item is frequent within the context of its parent item. In this work, we introduce and formalize the notion of Conditional Heavy Hitters to identify such items, with applications in network monitoring, and Markov chain modeling. We introduce several streaming algorithms that allow us to find conditional heavy hitters efficiently, and provide analytical results. Different algorithms are successful for different input characteristics. We perform experimental evaluations to demonstrate the efficacy of our methods, and to study which algorithms are most suited for different types of data.
Keywords :
Markov processes; data handling; Markov chain modeling; RFID monitoring; conditional heavy hitters; event mining; finding interesting correlations; network data analysis; network monitoring; observe data; sensor monitoring; streaming algorithms; Correlation; Data models; Data structures; Frequency estimation; Itemsets; Markov processes; Monitoring;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
ISSN :
1063-6382
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
Type :
conf
DOI :
10.1109/ICDE.2013.6544898
Filename :
6544898
Link To Document :
بازگشت