Title :
Architecture-dependent partitioning of dependence graphs
Author :
Beck, M. ; Zehendner, E. ; Ungerer, Th
Author_Institution :
Dept. Math. & Comput. Sci., Friedrich-Schiller-Univ., Jena, Germany
Abstract :
Performance tuning of non-blocking threads is based on graph partitioning algorithms that create serial code block from dependence graphs. Previously existing algorithms are directed toward deadlock-avoidance and maximisation of run-length. The latter criterion often generates a high synchronisation overhead. This paper presents a partitioning algorithm for dependence graphs that uses a heuristic to determine a cost-efficient solution based on an architecture-dependent cost function. We present empirical results based on benchmark programs that were compiled with MIT´s Id compiler, extended by our architecture-dependent partitioning algorithm. The results demonstrate a reduction in software overhead with our architecture-dependent partitioning algorithm, compared with previously existing partitioning methods. The execution of the sample programs on an emulator for the Monsoon dataflow architecture shows a reduced number of processor cycles
Keywords :
parallel architectures; parallel programming; performance evaluation; synchronisation; Id compiler; Monsoon dataflow architecture; architecture-dependent partitioning; benchmark programs; deadlock avoidance; dependence graphs; emulator; graph partitioning algorithms; performance tuning; processor cycles; run-length maximisation; serial code block; software overhead; Computer architecture; Computer science; Cost function; Mathematics; Out of order; Parallel processing; Partitioning algorithms; Software algorithms; System recovery; Yarn;
Conference_Titel :
Parallel and Distributed Processing, 1998. PDP '98. Proceedings of the Sixth Euromicro Workshop on
Conference_Location :
Madrid
Print_ISBN :
0-8186-8332-5
DOI :
10.1109/EMPDP.1998.647182