DocumentCode :
3517741
Title :
Efficient range query retrieval for non-uniform data distributions
Author :
Mohammed, Salahadin ; Harris, Evan P. ; Ramamohanarao, Kotagiri
Author_Institution :
Dept. of Comput. Technol., Monash Univ., Clayton, Vic., Australia
fYear :
2000
fDate :
2000
Firstpage :
90
Lastpage :
98
Abstract :
Answering range queries is a common database operation. Methods based on hashing techniques to minimise the cost of answering range queries by taking the query distribution into account have previously been proposed. These methods have all assumed a uniform distribution of data to disk pages to achieve good performance. This assumption makes them less useful in practice because most real data distributions are non-uniform. In this paper, we discuss a method to eliminate this restriction. Extensive experimentation using a multi-dimensional file structure, the BANG file, indicates that our method results in good performance for all data distributions. In one case an improvement of over 36 times was achieved without compromising the storage utilisation. Our method also results in a stable and efficient file organisation. If the query distribution does not change substantially, an optimised file organisation rarely requires reorganisation
Keywords :
database management systems; query processing; data distributions; file organisation; multi-dimensional file structure; query distribution; query retrieval; range queries; Australia; Computer science; Cost function; Delay; Electronic switching systems; Information retrieval; Interpolation; Multidimensional systems; Software engineering; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Conference, 2000. ADC 2000. Proceedings. 11th Australasian
Conference_Location :
Canberra, ACT
Print_ISBN :
0-7695-0528-7
Type :
conf
DOI :
10.1109/ADC.2000.819818
Filename :
819818
Link To Document :
بازگشت