Title :
Reliability and Scheduling on Systems Subject to Failures
Author :
Hakem, Mourad ; Butelle, Franck
Author_Institution :
LIPN - CNRS UMR 7030, Univ. Paris Nord, Villetaneuse
Abstract :
This paper presents a new bi-objective greedy heuristic for scheduling parallel applications on heterogeneous distributed computing systems. The proposed algorithm which is called BSA (bi-objective scheduling algorithm) takes into account not only the time makespan but also the failure probability of the application. Since it is not usually possible to achieve the two conflicting objectives (performance and reliability) simultaneously, a bi-objective compromise function is introduced. BSA has a low time complexity of O(eP + nu log omega), where e and nu are respectively the number of edges and tasks in the task graph of the application. P is the number of machines (processors) in the system and omega is the width of the task graph. Experimental results show the performance of the proposed algorithm.
Keywords :
computational complexity; graph theory; greedy algorithms; parallel algorithms; probability; processor scheduling; software reliability; biobjective compromise function; biobjective greedy heuristic algorithm; biobjective scheduling algorithm; failure probability; heterogeneous distributed computing system; parallel system scheduling; system reliability; task graph; time complexity; Clustering algorithms; Computer applications; Concurrent computing; Cost function; Delay; Distributed computing; Parallel processing; Processor scheduling; Reliability; Scheduling algorithm; DAGs; clustering; computing; directed acyclic graphs; distributed; heterogeneous systems.; multicriteria scheduling; precedence task graphs; reliability; scheduling;
Conference_Titel :
Parallel Processing, 2007. ICPP 2007. International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-0-7695-2933-2
DOI :
10.1109/ICPP.2007.72