Title :
Approximate Algorithms for Computing Spatial Distance Histograms with Accuracy Guarantees
Author :
Grupcev, Vladimir ; Yongke Yuan ; Yi-Cheng Tu ; Jin Huang ; Shaoping Chen ; Pandit, Shubha ; Weng, Ming-Fang
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
Abstract :
Particle simulation has become an important research tool in many scientific and engineering fields. Data generated by such simulations impose great challenges to database storage and query processing. One of the queries against particle simulation data, the spatial distance histogram (SDH) query, is the building block of many high-level analytics, and requires quadratic time to compute using a straightforward algorithm. Previous work has developed efficient algorithms that compute exact SDHs. While beating the naive solution, such algorithms are still not practical in processing SDH queries against large-scale simulation data. In this paper, we take a different path to tackle this problem by focusing on approximate algorithms with provable error bounds. We first present a solution derived from the aforementioned exact SDH algorithm, and this solution has running time that is unrelated to the system size N. We also develop a mathematical model to analyze the mechanism that leads to errors in the basic approximate algorithm. Our model provides insights on how the algorithm can be improved to achieve higher accuracy and efficiency. Such insights give rise to a new approximate algorithm with improved time/accuracy tradeoff. Experimental results confirm our analysis.
Keywords :
approximation theory; mathematical analysis; query processing; SDH algorithm; SDH query; accuracy guarantees; approximate algorithm; computing spatial distance histogram; database storage; engineering field; high-level analytics; large-scale simulation data; mathematical model; particle simulation data; provable error bound; quadratic time; query processing; scientific field; spatial distance histogram query; time/accuracy tradeoff; Accuracy; Algorithm design and analysis; Approximation algorithms; Computational modeling; Data models; Histograms; Synchronous digital hierarchy; Molecular simulation; quadtree; scientific databases; spatial distance histogram;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2012.149