DocumentCode
1961689
Title
DEMON: mining and monitoring evolving data
Author
Ganti, Venkatesh ; Gehrke, Johannes ; Ramakrishnan, Raghu
Author_Institution
Wisconsin Univ., Madison, WI, USA
fYear
2000
fDate
2000
Firstpage
439
Lastpage
448
Abstract
Data mining algorithms have been the focus of much research recently. In practice, the input data to a data mining process resides in a large data warehouse whose data is kept up-to-date through periodic or occasional addition and deletion of blocks of data. Most data mining algorithms have either assumed that the input data is static, or have been designed for arbitrary insertions and deletions of data records. We consider a dynamic environment that evolves through systematic addition or deletion of blocks of data. We introduce a new dimension called the data span dimension, which allows user-defined selections of a temporal subset of the database. Taking this new degree of freedom into account, we describe efficient model maintenance algorithms for frequent itemsets and clusters. We then describe a generic algorithm that takes any traditional incremental model maintenance algorithm and transforms it into an algorithm that allows restrictions on the data span dimension. In a detailed experimental study, we examine the validity and performance of our ideas
Keywords
data mining; data warehouses; software performance evaluation; temporal databases; DEMON; data addition; data deletion; data mining; data span dimension; evolving data monitoring; experimental study; incremental model maintenance algorithm; large data warehouse; temporal database; Algorithm design and analysis; Data analysis; Data mining; Data warehouses; Databases; Itemsets; Monitoring; Nominations and elections; Predictive models;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2000. Proceedings. 16th International Conference on
Conference_Location
San Diego, CA
ISSN
1063-6382
Print_ISBN
0-7695-0506-6
Type
conf
DOI
10.1109/ICDE.2000.839443
Filename
839443
Link To Document