Title :
Scalability analysis of declustering methods for multidimensional range queries
Author :
Moon, Bongki ; Saltz, Joel H.
Author_Institution :
Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
Abstract :
Efficient storage and retrieval of multi-attribute data sets has become one of the essential requirements for many data-intensive applications. The Cartesian product file has been known as an effective multi-attribute file structure for partial-match and best-match queries. Several heuristic methods have been developed to decluster Cartesian product files across multiple disks to obtain high performance for disk accesses. Although the scalability of the declustering methods becomes increasingly important for systems equipped with a large number of disks, no analytic studies have been done so far. The authors derive formulas describing the scalability of two popular declustering methods-Disk Module and Fieldwise Xor-for range queries, which are the most common type of queries. These formulas disclose the limited scalability of the declustering methods, and this is corroborated by extensive simulation experiments. From the practical point of view, the formulas given in the paper provide a simple measure that can be used to predict the response time of a given range query and to guide the selection of a declustering method under various conditions
Keywords :
database management systems; file organisation; query processing; Cartesian product file; Disk Module; Fieldwise Xor; best-match queries; data-intensive applications; declustering methods; disk accesses; heuristic methods; multi-attribute data set retrieval; multi-attribute data set storage; multi-attribute file structure; multidimensional range queries; multiple disks; partial-match queries; response time; scalability analysis; simulation experiments; Databases; Delay; Earth Observing System; Error correction codes; Information retrieval; Large-scale systems; Moon; Multidimensional systems; Scalability; Time measurement;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on