DocumentCode :
2503646
Title :
Data Sharing Pattern Aware Scheduling on Grids
Author :
Lee, Young Choon ; Zomaya, Albert Y.
Author_Institution :
Sch. of Inf. Technol., Sydney Univ., NSW
fYear :
2006
fDate :
14-18 Aug. 2006
Firstpage :
365
Lastpage :
372
Abstract :
These days an increasing number of applications, especially in science and engineering, are dealing with a massive amount of data; hence they are data-intensive. Bioinformatics, data-mining and image processing are some typical areas of data-intensive applications. Such applications tend to be deployed on grids that provide powerful processing capabilities at reasonable cost. One fundamental scheduling issue, that arises when exploiting grids with these types of applications, is the minimization of data transfer. Therefore, the use of an efficient scheduling scheme that takes into account data transfers is rather essential in order to achieve both a shorter application completion time and efficient system utilization. In this paper, a novel scheduling algorithm, called the shared input data based listing (SIL) algorithm for data-intensive bag-of-tasks (DBoT) applications in grid environments is proposed. The algorithm uses a set of task lists that are constructed taking the data sharing pattern into account and that are reorganized dynamically, based on performance of resources, during the execution of the application. The primary goal of this dynamic listing is to minimize data transfer, thus leading to shortening the overall completion time of DBoT applications. SIL further attempts to reduce serious schedule increases by adopting task duplication. In our evaluation study extensive simulation tests with three different types of the DBoT application model have been conducted. Based on the experimental results, SIL noticeably outperforms two previously proposed algorithms in schedule length
Keywords :
electronic data interchange; grid computing; scheduling; bioinformatics; data mining; data sharing pattern aware scheduling; data transfer minimization; data-intensive bag-of-task application; grid computing; image processing; scheduling algorithm; shared input data based listing algorithm; Australia; Bioinformatics; Data engineering; Data mining; Distributed computing; Image processing; Information technology; Power engineering and energy; Processor scheduling; Scheduling algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 2006. ICPP 2006. International Conference on
Conference_Location :
Columbus, OH
ISSN :
0190-3918
Print_ISBN :
0-7695-2636-5
Type :
conf
DOI :
10.1109/ICPP.2006.30
Filename :
1690639
Link To Document :
بازگشت