DocumentCode
27465
Title
Approximate Algorithms for Computing Spatial Distance Histograms with Accuracy Guarantees
Author
Grupcev, Vladimir ; Yongke Yuan ; Yi-Cheng Tu ; Jin Huang ; Shaoping Chen ; Pandit, Shubha ; Weng, Ming-Fang
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
Volume
25
Issue
9
fYear
2013
fDate
Sept. 2013
Firstpage
1982
Lastpage
1996
Abstract
Particle simulation has become an important research tool in many scientific and engineering fields. Data generated by such simulations impose great challenges to database storage and query processing. One of the queries against particle simulation data, the spatial distance histogram (SDH) query, is the building block of many high-level analytics, and requires quadratic time to compute using a straightforward algorithm. Previous work has developed efficient algorithms that compute exact SDHs. While beating the naive solution, such algorithms are still not practical in processing SDH queries against large-scale simulation data. In this paper, we take a different path to tackle this problem by focusing on approximate algorithms with provable error bounds. We first present a solution derived from the aforementioned exact SDH algorithm, and this solution has running time that is unrelated to the system size N. We also develop a mathematical model to analyze the mechanism that leads to errors in the basic approximate algorithm. Our model provides insights on how the algorithm can be improved to achieve higher accuracy and efficiency. Such insights give rise to a new approximate algorithm with improved time/accuracy tradeoff. Experimental results confirm our analysis.
Keywords
approximation theory; mathematical analysis; query processing; SDH algorithm; SDH query; accuracy guarantees; approximate algorithm; computing spatial distance histogram; database storage; engineering field; high-level analytics; large-scale simulation data; mathematical model; particle simulation data; provable error bound; quadratic time; query processing; scientific field; spatial distance histogram query; time/accuracy tradeoff; Accuracy; Algorithm design and analysis; Approximation algorithms; Computational modeling; Data models; Histograms; Synchronous digital hierarchy; Molecular simulation; quadtree; scientific databases; spatial distance histogram;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2012.149
Filename
6249685
Link To Document