Title :
Frequent itemsets mining for database auto-administration
Author :
Aouiche, Kamel ; Darmont, Jérôme ; Gruenwald, Le
Author_Institution :
Univ. of Lyon 2, France
Abstract :
With the wide development of databases in general and data warehouses in particular, it is important to reduce the tasks that a database administrator must perform manually. The aim of auto-administrative systems is to administrate and adapt themselves automatically without loss (or even with a gain) in performance. The idea of using data mining techniques to extract useful knowledge for administration from the data themselves has existed for some years. However, little research has been achieved. This idea nevertheless remains a very promising approach, notably in the field of data warehousing, where queries are very heterogeneous and cannot be interpreted easily. The aim of this study is to search for a way of extracting useful knowledge from stored data themselves to automatically apply performance optimization techniques, and more particularly indexing techniques. We have designed a tool that extracts frequent itemsets from a given workload to compute an index configuration that helps optimizing data access time. The experiments we performed showed that the index configurations generated by our tool allowed performance gains of 15% to 25% on a test database and a test data warehouse.
Keywords :
data mining; data reduction; data warehouses; database management systems; indexing; optimisation; DBA; adaptive system; autoadministrative system; data access time optimization; data mining; data warehouse; database autoadministration; database management system; heterogeneous query; index configuration; index utility; indexing; itemset frequency; itemset mining; knowledge extraction; performance gain; performance optimization technique; stored data; task reduction; transaction log file parsing; usage frequency; Data mining; Data warehouses; Databases; Indexing; Itemsets; Optimization; Performance gain; Performance loss; Testing; Warehousing;
Conference_Titel :
Database Engineering and Applications Symposium, 2003. Proceedings. Seventh International
Print_ISBN :
0-7695-1981-4
DOI :
10.1109/IDEAS.2003.1214915