Title : 
Bulk Scheduling With the DIANA Scheduler
         
        
            Author : 
Anjum, Ashiq ; McClatchey, Richard ; Ali, Arshad ; Willers, Ian
         
        
            Author_Institution : 
CCS Res. Centre, West of England Univ., Bristol
         
        
        
        
        
        
        
            Abstract : 
Results from the research and development of a Data Intensive and Network Aware (DIANA) scheduling engine, to be used primarily for data intensive sciences such as physics analysis, are described. In Grid analyses, tasks can involve thousands of computing, data handling, and network resources. The central problem in the scheduling of these resources is the coordinated management of computation and data at multiple locations and not just data replication or movement. However, this can prove to be a rather costly operation and efficient scheduling can be a challenge if compute and data resources are mapped without considering network costs. We have implemented an adaptive algorithm within the so-called DIANA Scheduler which takes into account data location and size, network performance and computation capability in order to enable efficient global scheduling. DIANA is a performance-aware and economy-guided Meta Scheduler. It iteratively allocates each job to the site that is most likely to produce the best performance as well as optimizing the global queue for any remaining jobs. Therefore, it is equally suitable whether a single job is being submitted or bulk scheduling is being performed. Results indicate that considerable performance improvements can be gained by adopting the DIANA scheduling approach
         
        
            Keywords : 
data analysis; grid computing; high energy physics instrumentation computing; position sensitive particle detectors; resource allocation; scheduling; CMS data analysis; Compact Muon Solenoid data analysis; DIANA scheduling engine; adaptive algorithm; bulk scheduling; data intensive sciences; economy-guided meta scheduler; grid analysis; job scheduling; physics analysis; resource allocation; Adaptive algorithm; Computer networks; Costs; Data handling; Engines; Grid computing; Physics; Processor scheduling; Research and development; Resource management; Bulk scheduling; data-intensive and network-aware (DIANA) scheduler; network-aware scheduling decisions; priority-driven multiqueue feedback algorithm;
         
        
        
            Journal_Title : 
Nuclear Science, IEEE Transactions on
         
        
        
        
        
            DOI : 
10.1109/TNS.2006.886047