DocumentCode :
3638971
Title :
JAWS: Job-Aware Workload Scheduling for the Exploration of Turbulence Simulations
Author :
Xiaodan Wang;Eric Perlman;Randal Burns;Tanu Malik;Tamas Budavári;Charles Meneveau;Alexander Szalay
fYear :
2010
Firstpage :
1
Lastpage :
11
Abstract :
We present JAWS, a job-aware, data-driven batch scheduler that improves query throughput for data-intensive scientific database clusters. As datasets reach petabyte-scale, workloads that scan through vast amounts of data to extract features are gaining importance in the sciences. However, acute performance bottlenecks result when multiple queries execute simultaneously and compete for I/O resources. Our solution, JAWS, divides queries into I/O-friendly sub-queries for scheduling. It then identifies overlapping data requirements within the workload and executes sub-queries in batches to maximize data sharing and reduce redundant I/O. JAWS extends our previous work by supporting workflows in which queries exhibit data dependencies, exploiting workload knowledge to coordinate caching decisions, and combating starvation through adaptive and incremental trade-offs between query throughput and response time. Instrumenting JAWS in the Turbulence Database Cluster yields nearly three-fold improvement in query throughput when contention in the workload is high.
Keywords :
"Throughput","Databases","Time factors","Heuristic algorithms","Astronomy","Instruments","Measurement"
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
Print_ISBN :
978-1-4244-7557-5
Type :
conf
DOI :
10.1109/SC.2010.31
Filename :
5644888
Link To Document :
بازگشت