DocumentCode :
577463
Title :
A study on the selection of bitmap join index using data mining techniques
Author :
An, Hyoung Geun ; Koh, Jae Jin
Author_Institution :
School of Electrical Engineering, University of Ulsan 680-749 San 29, Mugeo2-Dong, Nam-Gu, Ulsan Korea
fYear :
2012
fDate :
18-21 Sept. 2012
Firstpage :
1
Lastpage :
5
Abstract :
The problem of indices selection on the large size data warehouses affects the efficiency of the query processing of the data warehouses. The lower query processing costs can be obtained by using indices, but the indices occupy the large storage areas and induce the index maintenance costs which are accompanied by database updates. By using the bitmap join indices, we can optimize the star join queries which join a fact table and many dimension tables, and do the selection on dimension tables in data warehouses. Although the bitmap join index takes the lower storage costs, the task to select the indexing attributes among the huge candidate attributes is difficult. In this paper we reduce the number of candidate attributes for bitmap join index by the data mining techniques. Compared to the existing techniques which reduce the number of candidate attributes by the frequencies of attributes, we consider the frequencies of attributes and the size of dimmension tables and the size of the tuples of the dimension tables and the page size of disk. We reduce the great number of candidate attributes by using the frequent itemsets mining techniques. We make the bitmap join indices which have the least costs and the least storage areas adapted to storage constraints by using the cost functions applied to the bitmap join indices of the candidate attributes. We compare the existing techniques and ours, and analyze them in order to evaluate the efficiencies of ours.
Keywords :
cost reduction; data mining; data warehouses; disc storage; indexing; optimisation; query processing; relational databases; storage management; bitmap join index selection; candidate attributes; cost functions; data mining techniques; dimension tables; disk page size; indexing attributes; large size data warehouses; query processing costs; star join query optimization; storage areas; storage costs; Data mining; Data warehouses; Indexes; Itemsets; Marketing and sales; Query processing; Bitmap join index; data mining; frequent itemsets; relational data warehouse;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Strategic Technology (IFOST), 2012 7th International Forum on
Conference_Location :
Tomsk
Print_ISBN :
978-1-4673-1772-6
Type :
conf
DOI :
10.1109/IFOST.2012.6357667
Filename :
6357667
Link To Document :
بازگشت