DocumentCode :
3086141
Title :
Effective Summarization of Multi-Dimensional Data Streams for Historical Stream Mining
Author :
Nassar, Samer ; Sander, Joerg
Author_Institution :
Alberta Univ., Edmonton
fYear :
2007
fDate :
9-11 July 2007
Firstpage :
30
Lastpage :
30
Abstract :
We consider the following problem: given a very large data stream, a limited space to encode the stream, and a compression technique to compress the stream, retain the most important information from the distant past of the stream while at the same time retain high quality of the compressed information that is in the recent part of the stream to perform temporal analysis of the summarized information. Simple schemes for accumulating micro-clustering summaries of stream windows that have been previously proposed are very ineffective for solving this challenging task. We overcome the limitations of these schemes by first identifying spatial summaries that compress "similar\´ regions in the data space, and reduce their space consumption using novel approximate spatio-temporal summaries. Second, we present policies for effectively utilizing the space budget and managing these novel approximate spatio-temporal summaries.
Keywords :
data analysis; data compression; data mining; compressed information; historical stream mining; multidimensional data streams; summarization; temporal analysis; Data mining; Extraterrestrial measurements; Financial management; High performance computing; History; Information analysis; Multidimensional systems; Pattern analysis; Performance analysis; Quality management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 2007. SSBDM '07. 19th International Conference on
Conference_Location :
Banff, Alta.
ISSN :
1551-6393
Print_ISBN :
0-7695-2868-6
Electronic_ISBN :
1551-6393
Type :
conf
DOI :
10.1109/SSDBM.2007.32
Filename :
4274975
Link To Document :
بازگشت