Title :
SDQuery DSI: Integrating data management support with a wide area data transfer protocol
Author :
Yu Su ; Yi Wang ; Agrawal, Gagan ; Kettimuthu, Rajkumar
Author_Institution :
Comput. Sci. & Eng, Ohio State Univ., Columbus, OH, USA
Abstract :
In many science areas where datasets need to be transferred or shared, rapid growth in dataset size, coupled with much slower increases in wide area data transfer bandwidths, is making it extremely hard for scientists to analyze the data. This paper addresses the current limitations by developing SDQuery DSI, a GridFTP plug-in that supports flexible server-side data subsetting. An existing GridFTP server is able to dynamically load this tool to support new functionality. Different queries types (query over dimensions, coordinates and values) are supported by our tool. A number of optimizations, like parallel indexing, performance model for data subsetting, and parallel streaming are also applied. We compare our SDQuery DSI with GridFTP default File DSI in different network environments, and show that our method can achieve better efficiency in almost all cases.
Keywords :
database indexing; grid computing; query processing; wide area networks; GridFTP server; SDQuery DSI; data management support; data subsetting; flexible server-side data subsetting; parallel indexing; parallel streaming; performance model; wide area data transfer protocol; Abstracts; Filtering; Indexing; Photonics; Ports (Computers); Protocols; Security; I/O performance tuning; data management; indexing; query processing; wide area networks;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2013 International Conference for
Conference_Location :
Denver, CO
Print_ISBN :
978-1-4503-2378-9