DocumentCode :
3305999
Title :
Experimental evaluation of a new distributed partitioning technique for data warehouses
Author :
Bernardino, Jorge ; Madeira, Henrique
Author_Institution :
Inst. Polytech. of Coimbra, Portugal
fYear :
2001
fDate :
2001
Firstpage :
312
Lastpage :
321
Abstract :
Since data warehousing has become a major field of research there has been a lot of interest in reducing the response time of complex queries posed over the very large databases. The problem is that data warehouses store large amounts of data for decision support, requiring a high level of query performance and scalability to the database engines. A novel round-robin data partitioning approach especially designed for relational data warehouse environments is proposed and experimentally evaluated. This approach is specific to data warehouses implemented over relational repositories using the star schema, as it takes advantage of the specific characteristics of star schemas and typical data warehouse query profiles. The proposed approach guarantees optimal load balancing of query execution and assures high scalability. The experimental evaluation presented in the paper, using a comprehensive set of typical queries from the APB-I benchmark running over Oracle 8, shows that an optimal speedup can be obtained with this technique. The proposed technique constitutes an effective and practical way of coping with very large data warehouses and can be applied to existing database technology
Keywords :
data handling; data warehouses; distributed processing; query processing; relational databases; resource allocation; APB-I benchmark; Oracle 8; complex queries; data warehouse query profiles; data warehousing; database engines; database technology; decision support; distributed partitioning technique; optimal load balancing; optimal speedup; query execution; query performance; relational data warehouse environments; relational repositories; response time reduction; round-robin data partitioning approach; scalability; star schema; very large data warehouses; very large databases; Concurrent computing; Costs; Data warehouses; Delay; Distributed computing; Indexes; Query processing; Relational databases; Scalability; Warehousing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Engineering and Applications, 2001 International Symposium on.
Conference_Location :
Grenoble
Print_ISBN :
0-7695-1140-6
Type :
conf
DOI :
10.1109/IDEAS.2001.938099
Filename :
938099
Link To Document :
بازگشت