• DocumentCode
    48984
  • Title

    Computing Spatial Distance Histograms for Large Scientific Data Sets On-the-Fly

  • Author

    Kumar, Ajit ; Grupcev, Vladimir ; Yongke Yuan ; Jin Huang ; Yi-Cheng Tu ; Gang Shen

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
  • Volume
    26
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    2410
  • Lastpage
    2424
  • Abstract
    This paper focuses on an important query in scientific simulation data analysis: the Spatial Distance Histogram (SDH). The computation time of an SDH query using brute force method is quadratic. Often, such queries are executed continuously over certain time periods, increasing the computation time. We propose highly efficient approximate algorithm to compute SDH over consecutive time periods with provable error bounds. The key idea of our algorithm is to derive statistical distribution of distances from the spatial and temporal characteristics of particles. Upon organizing the data into a Quad-tree based structure, the spatiotemporal characteristics of particles in each node of the tree are acquired to determine the particles´ spatial distribution as well as their temporal locality in consecutive time periods. We report our efforts in implementing and optimizing the above algorithm in graphics processing units (GPUs) as means to further improve the efficiency. The accuracy and efficiency of the proposed algorithm is backed by mathematical analysis and results of extensive experiments using data generated from real simulation studies.
  • Keywords
    data analysis; quadtrees; query processing; statistical distributions; GPUs; SDH query; approximate algorithm; brute force method; consecutive time periods; graphics processing units; large scientific data set on-the-fly; mathematical analysis; particle spatial distribution; provable error bounds; quadtree based structure; scientific simulation data analysis; spatial distance histograms; spatiotemporal particle characteristics; statistical distribution; temporal locality; Algorithm design and analysis; Approximation algorithms; Computational modeling; Distribution functions; Graphical models; Graphics processing units; Synchronous digital hierarchy; GPU; Scientific databases; density map; quad-tree; spatial distance histogram; spatiotemporal locality;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2298015
  • Filename
    6702476