Title :
Matchmaking: distributed resource management for high throughput computing
Author :
Raman, Rajesh ; Livny, Miron ; Solomon, Marvin
Author_Institution :
Wisconsin Univ., Madison, WI, USA
Abstract :
Conventional resource management systems use a system model to describe resources and a centralized scheduler to control their allocation. We argue that this paradigm does not adapt well to distributed systems, particularly those built to support high throughput computing. Obstacles include heterogeneity of resources, which make uniform allocation algorithms difficult to formulate, and distributed ownership, leading to widely varying allocation policies. Faced with these problems, we developed and implemented the classified advertisement (classad) matchmaking framework, a flexible and general approach to resource management in distributed environment with decentralized ownership of resources. Novel aspects of the framework include a semi structured data model that combines schema, data, and query in a simple but powerful specification language, and a clean separation of the matching and claiming phases of resource allocation. The representation and protocols result in a robust, scalable and flexible framework that can evolve with changing resources. The framework was designed to solve real problems encountered in the deployment of Condor, a high throughput computing system developed at the University of Wisconsin-Madison. Condor is heavily used by scientists at numerous sites around the world. It derives much of its robustness and efficiency from the matchmaking architecture
Keywords :
data structures; distributed processing; resource allocation; scheduling; specification languages; Condor; centralized scheduler; classad matchmaking framework; classified advertisement matchmaking framework; decentralized ownership; distributed environment; distributed ownership; distributed resource management; flexible framework; high throughput computing; high throughput computing system; matchmaking architecture; resource management systems; semi structured data model; specification language; system model; Centralized control; Computer architecture; Data models; Distributed computing; Processor scheduling; Protocols; Resource management; Robustness; Specification languages; Throughput;
Conference_Titel :
High Performance Distributed Computing, 1998. Proceedings. The Seventh International Symposium on
Conference_Location :
Chicago, IL
Print_ISBN :
0-8186-8579-4
DOI :
10.1109/HPDC.1998.709966