Title :
I/O streaming evaluation of batch queries for data-intensive computational turbulence
Author :
Kanov, Kalin ; Perlman, Eric ; Burns, Randal ; Ahmad, Yanif ; Szalay, Alexander
Author_Institution :
Dept. of Comput. Sci., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
We describe a method for evaluating computational turbulence queries, including Lagrange Polynomial interpolation, based on partial sums that allows the underlying data to be accessed in any order and in parts. We exploit these properties to stream data from disk in a single pass and concurrently evaluate batch queries. The combination of sequential I/O and data sharing improves performance by an order of magnitude when compared with direct evaluation of each query. The technique also supports distributed evaluation of queries in a database cluster, assembling the partial sums from each node at the query mediator. Interpolation is fundamental to computational turbulence, over 95% of queries use these routines, and the partial sums method al- lows the JHU Turbulence Database Cluster to realize scale and throughput for our scientists´ data-intensive workloads.
Keywords :
database management systems; input-output programs; interpolation; polynomial approximation; query processing; I/O streaming evaluation; JHU turbulence; Lagrange polynomial interpolation; batch queries; data intensive computational turbulence; data intensive workloads; database cluster; query mediator; Distributed databases; Interpolation; Kernel; Polynomials; Sorting; Vectors; Data-intensive computing; I/O streaming; database clusters; query evaluation; query optimization; software for high-throughput computing;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for
Conference_Location :
Seatle, WA
Electronic_ISBN :
978-1-4503-0771-0