DocumentCode :
1451365
Title :
DEMON: mining and monitoring evolving data
Author :
Ganti, Venkatesh ; Gehrke, Johannes ; Ramakrishnan, Raghu
Author_Institution :
Dept. of Comput. Sci., Wisconsin Univ., Madison, WI, USA
Volume :
13
Issue :
1
fYear :
2001
Firstpage :
50
Lastpage :
63
Abstract :
Data mining algorithms have been the focus of much research. In practice, the input data to a data mining process resides in a large data warehouse whose data is kept up-to-date through periodic or occasional addition and deletion of blocks of data. Most data mining algorithms have either assumed that the input data is static, or have been designed for arbitrary insertions and deletions of data records. We consider a dynamic environment that evolves through systematic addition or deletion of blocks of data. We introduce a new dimension, called the data span dimension, which allows user-defined selections of a temporal subset of the database. Taking this new degree of freedom into account, we describe efficient model maintenance algorithms for frequent item sets and clusters. We then describe a generic algorithm that takes any traditional incremental model maintenance algorithm and transforms it into an algorithm that allows restrictions on the data span dimension. We also develop an algorithm for automatically discovering a specific class of interesting block selection sequences. In a detailed experimental study, we examine the validity and performance of our ideas on synthetic and real datasets
Keywords :
data mining; data warehouses; DEMON; block selection sequences; data addition; data deletion; data mining; data records; data span dimension; evolving data monitoring; experiment; large data warehouse; model maintenance algorithms; Algorithm design and analysis; Clustering algorithms; Data analysis; Data mining; Data warehouses; Deductive databases; Itemsets; Monitoring; Predictive models; Space exploration;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.908980
Filename :
908980
Link To Document :
بازگشت