DocumentCode :
343447
Title :
Multidimensional indexing and query coordination for tertiary storage management
Author :
Shoshani, A. ; Bernardo, L.M. ; Nordberg, H. ; Rotem, D. ; Sim, A.
Author_Institution :
Nat. Energy Res. Sci. Comput. Div., Lawrence Berkeley Lab., CA, USA
fYear :
1999
fDate :
36373
Firstpage :
214
Lastpage :
225
Abstract :
In many scientific domains, experimental devices or simulation programs generate large volumes of data. The volumes of data may reach hundreds of terabytes and therefore it is impractical to store them on disk systems. Rather they are stored on robotic tape systems that are managed by some mass storage system (MSS). A major bottleneck in analyzing the simulated/collected data is the retrieval of subsets from the tertiary storage system. We describe the architecture and implementation of a Storage Access Coordination System (STACS) designed to optimize the use of a disk cache, and thus minimize the number of files read from tape. We achieve this by using a specialized index to locate the relevant data on tapes, and by coordinating file caching over multiple queries. We focus on a specific application area, a high energy physics data management and analysis environment. STACS was implemented and is being incorporated in an operational system, scheduled to go online at the end of 1999. We also include the results of various tests that demonstrate the benefits and efficiency gained of using the STACS
Keywords :
cache storage; database indexing; physics computing; query processing; scientific information systems; storage management; STACS; Storage Access Coordination System; disk cache; disk systems; experimental devices; file caching; high energy physics data management; mass storage system; multidimensional indexing; multiple queries; operational system; query coordination; robotic tape systems; scientific domains; simulation programs; specialized index; subset retrieval; tertiary storage management; tertiary storage system; Analytical models; Cache storage; Data analysis; Design optimization; Energy management; Environmental management; Indexing; Information retrieval; Multidimensional systems; Robot kinematics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 1999. Eleventh International Conference on
Conference_Location :
Cleveland, OH
Print_ISBN :
0-7695-0046-3
Type :
conf
DOI :
10.1109/SSDM.1999.787637
Filename :
787637
Link To Document :
بازگشت