Title :
Autonomous protocols for bandwidth-centric scheduling of independent-task applications
Author :
Kreaseck, Barbara ; Carter, Larry ; Casanova, Henri ; Ferrante, Jeanne
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of California, San Diego, CA, USA
Abstract :
In this paper we investigate protocols for scheduling applications that consist of large numbers of identical, independent tasks on large-scale computing platforms. By imposing a tree structure on an overlay network of computing nodes, our previous work showed that it is possible to compute the schedule which leads to the optimal steady-state task completion rate. However, implementing this optimal schedule in practice, without prohibitive global coordination of all the computing nodes or unlimited buffers, remained an open question. To address this question, in this paper we develop autonomous scheduling protocols, i.e. distributed scheduling algorithms by which each node makes scheduling decisions based solely on locally available information. Our protocols have two variants: with non-interruptible and with interruptible communications. Further, we evaluate both protocols using simulations on randomly generated trees. We show that the non-interruptible communication version may need a prohibitive number of buffers at each node. However, our autonomous protocol with interruptible communication and only 3 buffers per node reaches the optimal steady-state performance in over 99.5% of our simulations. The autonomous scheduling approach is inherently scalable and adaptable, and thus ideally suited to currently emerging computing platforms. In particular this work has direct impact on the deployment of large applications on Grid, and peer-to-peer computing platforms.
Keywords :
distributed algorithms; grid computing; performance evaluation; processor scheduling; protocols; trees (mathematics); Grid computing; autonomous protocols; bandwidth-centric scheduling; distributed scheduling algorithms; identical independent tasks; independent-task applications; interruptible communication; large-scale computing platforms; optimal steady-state performance; peer-to-peer computing; randomly generated trees; simulations; Computational modeling; Computer networks; Large-scale systems; Optimal scheduling; Peer to peer computing; Processor scheduling; Protocols; Scheduling algorithm; Steady-state; Tree data structures;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
Print_ISBN :
0-7695-1926-1
DOI :
10.1109/IPDPS.2003.1213104