• DocumentCode
    692892
  • Title

    SDQuery DSI: Integrating data management support with a wide area data transfer protocol

  • Author

    Yu Su ; Yi Wang ; Agrawal, Gagan ; Kettimuthu, Rajkumar

  • Author_Institution
    Comput. Sci. & Eng, Ohio State Univ., Columbus, OH, USA
  • fYear
    2013
  • fDate
    17-22 Nov. 2013
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    In many science areas where datasets need to be transferred or shared, rapid growth in dataset size, coupled with much slower increases in wide area data transfer bandwidths, is making it extremely hard for scientists to analyze the data. This paper addresses the current limitations by developing SDQuery DSI, a GridFTP plug-in that supports flexible server-side data subsetting. An existing GridFTP server is able to dynamically load this tool to support new functionality. Different queries types (query over dimensions, coordinates and values) are supported by our tool. A number of optimizations, like parallel indexing, performance model for data subsetting, and parallel streaming are also applied. We compare our SDQuery DSI with GridFTP default File DSI in different network environments, and show that our method can achieve better efficiency in almost all cases.
  • Keywords
    database indexing; grid computing; query processing; wide area networks; GridFTP server; SDQuery DSI; data management support; data subsetting; flexible server-side data subsetting; parallel indexing; parallel streaming; performance model; wide area data transfer protocol; Abstracts; Filtering; Indexing; Photonics; Ports (Computers); Protocols; Security; I/O performance tuning; data management; indexing; query processing; wide area networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2013 International Conference for
  • Conference_Location
    Denver, CO
  • Print_ISBN
    978-1-4503-2378-9
  • Type

    conf

  • Filename
    6877480