DocumentCode
692892
Title
SDQuery DSI: Integrating data management support with a wide area data transfer protocol
Author
Yu Su ; Yi Wang ; Agrawal, Gagan ; Kettimuthu, Rajkumar
Author_Institution
Comput. Sci. & Eng, Ohio State Univ., Columbus, OH, USA
fYear
2013
fDate
17-22 Nov. 2013
Firstpage
1
Lastpage
12
Abstract
In many science areas where datasets need to be transferred or shared, rapid growth in dataset size, coupled with much slower increases in wide area data transfer bandwidths, is making it extremely hard for scientists to analyze the data. This paper addresses the current limitations by developing SDQuery DSI, a GridFTP plug-in that supports flexible server-side data subsetting. An existing GridFTP server is able to dynamically load this tool to support new functionality. Different queries types (query over dimensions, coordinates and values) are supported by our tool. A number of optimizations, like parallel indexing, performance model for data subsetting, and parallel streaming are also applied. We compare our SDQuery DSI with GridFTP default File DSI in different network environments, and show that our method can achieve better efficiency in almost all cases.
Keywords
database indexing; grid computing; query processing; wide area networks; GridFTP server; SDQuery DSI; data management support; data subsetting; flexible server-side data subsetting; parallel indexing; parallel streaming; performance model; wide area data transfer protocol; Abstracts; Filtering; Indexing; Photonics; Ports (Computers); Protocols; Security; I/O performance tuning; data management; indexing; query processing; wide area networks;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2013 International Conference for
Conference_Location
Denver, CO
Print_ISBN
978-1-4503-2378-9
Type
conf
Filename
6877480
Link To Document