DocumentCode
3122203
Title
Feedback-controlled resource sharing for predictable eScience
Author
Park, Sang-Min ; Humphrey, Marty
Author_Institution
Dept. of Comput. Sci., Univ. of Virginia, Charlottesville, VA, USA
fYear
2008
fDate
15-21 Nov. 2008
Firstpage
1
Lastpage
12
Abstract
The emerging class of adaptive, real-time, data-driven applications is a significant problem for today´s HPC systems. In general, it is extremely difficult for queuing-system-controlled HPC resources to make and guarantee a tightly-bounded prediction regarding the time at which a newly-submitted application will execute. While a reservation-based approach partially addresses the problem, it can create severe resource under-utilization (unused reservations, necessary scheduled idle slots, underutilized reservations, etc.) that resource providers are eager to avoid. In contrast, this paper presents a fundamentally different approach to guarantee predictable execution. By creating a virtualized application layer called the performance container, and opportunistically multiplexing concurrent performance containers through the application of formal feedback control theory, we regulate the job´s progress such that the job meets its deadline without requiring exclusive access to resources even in the presence of a wide class of unexpected disturbances. Our evaluation using two widely-used applications, WRF and BLAST, on an 8-core server show our approach is predictable and meets deadlines with 3.4 % of errors on average while achieving high overall utilization.
Keywords
feedback; queueing theory; scientific information systems; feedback-control theory; high performance computing; opportunistically multiplexing concurrent performance container; predictable eScience; queuing-system-controlled HPC resource; reservation-based approach; resource sharing; tightly-bounded prediction; Application software; Application virtualization; Computer science; Containers; Physics computing; Real time systems; Resource management; Resource virtualization; Supercomputers; Weather forecasting;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
Conference_Location
Austin, TX
Print_ISBN
978-1-4244-2834-2
Electronic_ISBN
978-1-4244-2835-9
Type
conf
DOI
10.1109/SC.2008.5217786
Filename
5217786
Link To Document