DocumentCode :
1038898
Title :
Using processor affinity in loop scheduling on shared-memory multiprocessors
Author :
Markatos, Evangelos P. ; LeBlanc, Thomas J.
Author_Institution :
Dept. of Comput. Sci., Rochester Univ., NY, USA
Volume :
5
Issue :
4
fYear :
1994
fDate :
4/1/1994 12:00:00 AM
Firstpage :
379
Lastpage :
400
Abstract :
Loops are the single largest source of parallelism in many applications. One way to exploit this parallelism is to execute loop iterations in parallel on different processors. Previous approaches to loop scheduling attempted to achieve the minimum completion time by distributing the workload as evenly as possible while minimizing the number of synchronization operations required. The authors consider a third dimension to the problem of loop scheduling on shared-memory multiprocessors: communication overhead caused by accesses to nonlocal data. They show that traditional algorithms for loop scheduling, which ignore the location of data when assigning iterations to processors, incur a significant performance penalty on modern shared-memory multiprocessors. They propose a new loop scheduling algorithm that attempts to simultaneously balance the workload, minimize synchronization, and co-locate loop iterations with the necessary data. They compare the performance of this new algorithm to other known algorithms by using five representative kernel programs on a Silicon Graphics multiprocessor workstation, a BBN Butterfly, a Sequent Symmetry, and a KSR-1, and show that the new algorithm offers substantial performance improvements, up to a factor of 4 in some cases. The authors conclude that loop scheduling algorithms for shared-memory multiprocessors cannot afford to ignore the location of data, particularly in light of the increasing disparity between processor and memory speeds
Keywords :
performance evaluation; scheduling; shared memory systems; BBN Butterfly; KSR-1; Sequent Symmetry; Silicon Graphics multiprocessor; communication overhead; iterations; kernel programs; load imbalance; loop iterations; loop scheduling; performance improvements; processor affinity; shared-memory multiprocessors; synchronization; Computer science; Dynamic scheduling; Graphics; Kernel; Load management; Parallel processing; Processor scheduling; Runtime; Scheduling algorithm; Silicon;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.273046
Filename :
273046
Link To Document :
بازگشت