DocumentCode :
1203297
Title :
A combined method for maintaining large indices in multiprocessor multidisk environments
Author :
Matsliach, Gabriel ; Shmueli, Oded
Author_Institution :
Dept. of Comput. Sci., Technion-Israel Inst. of Technol., Haifa, Israel
Volume :
6
Issue :
3
fYear :
1994
fDate :
6/1/1994 12:00:00 AM
Firstpage :
479
Lastpage :
496
Abstract :
Consider the problem of maintaining large indices (or secondary memory indices) in a multiprocessor multidisk environment in which each processor has a dedicated secondary memory (one disk or more). The processors either reside in the same site and communicate via shared memory, or reside in different sites and communicate via a local broadcast network. The straightforward method (SFM) for maintaining such an index, which is commonly called declustering, is to partition the index records equally among the processors, each of which maintains its part of the index in a local B+-tree. In prior work (Inform. Processing Lett., vol. 34, pp. 313-321, May 1990), we have presented another method, called the “totally distributed B+-tree” (TDB) method, in which all processors together implement a “wide” B+-tree. There are settings in which the second method is better than the first method, and vice versa. In this paper, we present a new method, called the combined distribution method (CDM), that combines the ideas underlying SFM and TDB. In tightly coupled environments, CDM outperforms both SFM and TDB in almost all practical settings (in many settings by more than 30%). This is shown by an approximate analysis and verified by simulations. Note that CDM´s approach can improve performance in database systems that use a RAID (redundant array of inexpensive disks)
Keywords :
distributed databases; indexing; magnetic disc storage; multiprocessing systems; performance evaluation; tree data structures; RAID; approximate analysis; combined distribution method; data structures; database systems; declustering; dedicated secondary memory; distributed file systems; distributed indices; index record partitioning; large index maintenance; local B+-tree; local broadcast network; multiprocessor multidisk environments; performance; redundant array of inexpensive disks; secondary memory indices; shared memory; simulation; straightforward method; tightly coupled environments; totally distributed B+-tree method; Analytical models; Broadcasting; Communication networks; Computer science; Costs; Data structures; Database machines; Database systems; File systems; Modems;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.334867
Filename :
334867
Link To Document :
بازگشت