DocumentCode
2848366
Title
Maintaining Implicated Statistics in Constrained Environments
Author
Sismanis, Yannis ; Roussopoulos, Nick
Author_Institution
IBM Almaden Res. Center, San Jose, CA, USA
fYear
2005
fDate
05-08 April 2005
Firstpage
730
Lastpage
741
Abstract
Aggregated information regarding implicated entities is critical for online applications like network management, traffic characterization or identifying patters of resource consumption. Recently there has been a flurry of research for online aggregation on streams (like quantiles, hot items, hierarchical heavy hitters) but surprizingly the problem of summarizing implicated information in stream data has received no attention. As an example, consider an IP-network and the implication source → destination. Flash crowds, — such as those that follow recent sport events (like the olympics) or seek information regarding catastrophic events — or denial of service attacks direct a large volume of traffic from a huge number of sources to a very small number of destinations. In this paper we present novel randomized algorithms for monitoring such implications with constraints in both memory and processing power for environments like network routers. Our experiments demonstrate several factors of improvements over straightforward approaches.
Keywords
data integrity; database management systems; randomised algorithms; statistical analysis; constrained environments; implicated statistics maintenance; network routers; online aggregation; randomized algorithms; Aggregates; Association rules; Collaborative work; Frequency; Government; Itemsets; Monitoring; Statistics; Telecommunication traffic; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
ISSN
1084-4627
Print_ISBN
0-7695-2285-8
Type
conf
DOI
10.1109/ICDE.2005.84
Filename
1410188
Link To Document