DocumentCode
402233
Title
Applying database support for large scale data driven science in distributed environments
Author
Narayanan, Sivaramakrishnan ; Catalyurek, Umit ; Kurc, Tahsin ; Zhang, Xi ; Saltz, Joel
Author_Institution
Dept. of Biomed. Informatics, Ohio State Univ., Columbus, OH, USA
fYear
2003
fDate
17 Nov. 2003
Firstpage
141
Lastpage
148
Abstract
There is a rapidly growing set of applications, referred to as data driven applications, in which analysis of large amounts of data drives the next steps taken by the scientist, e.g., running new simulations, doing additional measurements, extending the analysis to larger data collections. Critical steps in data analysis are to extract the data of interest from large and potentially distributed datasets and to move it from storage clusters to compute clusters for processing. We have developed a middleware framework, called GridDB-Lite, that is designed to efficiently support these two steps. We describe the application of GridDB-Lite in large scale oil reservoir simulation studies and experimentally evaluate several optimizations that can be employed in the GridDB-Lite runtime system.
Keywords
data analysis; distributed databases; grid computing; middleware; query formulation; query processing; relational databases; GridDB-Lite middleware; GridDB-Lite runtime system; data analysis; data extraction; database support; distributed environments; large scale oil reservoir simulation; query formulation; query optimization; storage clusters; Analytical models; Computational modeling; Data analysis; Data mining; Distributed computing; Distributed databases; Hydrocarbon reservoirs; Large-scale systems; Middleware; Petroleum;
fLanguage
English
Publisher
ieee
Conference_Titel
Grid Computing, 2003. Proceedings. Fourth International Workshop on
Print_ISBN
0-7695-2026-X
Type
conf
DOI
10.1109/GRID.2003.1261709
Filename
1261709
Link To Document