• DocumentCode
    3122203
  • Title

    Feedback-controlled resource sharing for predictable eScience

  • Author

    Park, Sang-Min ; Humphrey, Marty

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Virginia, Charlottesville, VA, USA
  • fYear
    2008
  • fDate
    15-21 Nov. 2008
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    The emerging class of adaptive, real-time, data-driven applications is a significant problem for today´s HPC systems. In general, it is extremely difficult for queuing-system-controlled HPC resources to make and guarantee a tightly-bounded prediction regarding the time at which a newly-submitted application will execute. While a reservation-based approach partially addresses the problem, it can create severe resource under-utilization (unused reservations, necessary scheduled idle slots, underutilized reservations, etc.) that resource providers are eager to avoid. In contrast, this paper presents a fundamentally different approach to guarantee predictable execution. By creating a virtualized application layer called the performance container, and opportunistically multiplexing concurrent performance containers through the application of formal feedback control theory, we regulate the job´s progress such that the job meets its deadline without requiring exclusive access to resources even in the presence of a wide class of unexpected disturbances. Our evaluation using two widely-used applications, WRF and BLAST, on an 8-core server show our approach is predictable and meets deadlines with 3.4 % of errors on average while achieving high overall utilization.
  • Keywords
    feedback; queueing theory; scientific information systems; feedback-control theory; high performance computing; opportunistically multiplexing concurrent performance container; predictable eScience; queuing-system-controlled HPC resource; reservation-based approach; resource sharing; tightly-bounded prediction; Application software; Application virtualization; Computer science; Containers; Physics computing; Real time systems; Resource management; Resource virtualization; Supercomputers; Weather forecasting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-4244-2834-2
  • Electronic_ISBN
    978-1-4244-2835-9
  • Type

    conf

  • DOI
    10.1109/SC.2008.5217786
  • Filename
    5217786