DocumentCode
2507334
Title
Automating layout of relational databases
Author
Agrawal, Sanjay ; Chaudhuri, Surajit ; Das, Abhinandan ; Narasayya, Vivek
Author_Institution
Microsoft Res., USA
fYear
2003
fDate
5-8 March 2003
Firstpage
607
Lastpage
618
Abstract
The choice of database layout, i.e., how database objects such as tables and indexes are assigned to disk drives can significantly impact the I/O performance of the system. Today, DBAs typically rely on fully striping objects across all available disk drives as the basic mechanism for optimizing I/O performance. While full striping maximizes I/O parallelism, when query execution involves coaccess of two or more large objects, e.g., a merge join of two tables, the above strategy may be suboptimal due to the increased number of random I/O accesses on each disk drive. Here, we propose a framework for automating the choice of database layout for a given database that also takes into account the effects of coaccessed objects in the workload faced by the system. We formulate the above as an optimization problem and present an efficient solution to the problem that judiciously takes into account the trade-off between I/O parallelism and random I/O accesses. Our experiments on Microsoft SQL server show the superior I/O performance of our techniques compared to the traditional approach of fully striping each database object across all disk drives.
Keywords
SQL; disc drives; query processing; relational databases; DBA; I/O parallelism; I/O performance optimization; Microsoft SQL server; database index; database layout; database table; disk drive; query execution; random I/O access; relational database; Decision support systems; Disk drives; Indexes; Information retrieval; Relational databases; Time measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2003. Proceedings. 19th International Conference on
Print_ISBN
0-7803-7665-X
Type
conf
DOI
10.1109/ICDE.2003.1260825
Filename
1260825
Link To Document