Title :
Forgetting data intelligently in data warehouses
Author :
Boly, A. ; Hebrail, G.
Author_Institution :
Lab. LTCI - UMR 5141 CNRS, Ecole Nat. Super. des Telecommun. Paris, Paris
Abstract :
The amount of data stored in data warehouses grows very quickly so that they can get saturated. To overcome this problem, we propose a language for specifying forgetting functions on stored data. In order to preserve the possibility of performing interesting analyses of historical data, the specifications include the definition of some summaries of deleted data. These summaries are aggregates and samples of deleted data and will be kept in the data warehouse. Once forgetting functions have been specified, the data warehouse is automatically updated in order to follow the specifications. This paper presents both the language for specifications, the structure of the summaries and the algorithms to update the data warehouse.
Keywords :
data warehouses; specification languages; data aggregation; data warehouse; forgetting function specification language; historical data analysis; Aggregates; Algorithm design and analysis; Costs; Data analysis; Data warehouses; Performance analysis; Relational databases; Research and development; Sampling methods; Specification languages; Aggregation; Data cubes; Data warehouses; Forgetting functions;
Conference_Titel :
Research, Innovation and Vision for the Future, 2007 IEEE International Conference on
Conference_Location :
Hanoi
Print_ISBN :
1-4244-0694-3
DOI :
10.1109/RIVF.2007.369160