Title :
A general theory for deadlock avoidance in wormhole-routed networks
Author :
Fleury, Eric ; Fraigniaud, Pierre
Author_Institution :
LORIA, Villers-Les-Nancy, France
fDate :
7/1/1998 12:00:00 AM
Abstract :
Most machines of the last generation of distributed memory parallel computers possess specific routers which are used to exchange messages between nonneighboring nodes in the network. Among the several technologies, wormhole routing is usually preferred because it allows low channel-setup time and reduces the dependency between latency and internode distance. However, wormhole routing is very susceptible to deadlock because messages are allowed to hold many resources while requesting others. Therefore, designing deadlock-free routing algorithms using few hardware facilities is a major problem for wormhole-routed networks. In this paper, we describe a general theoretical framework for the study of deadlock-free routing functions. We give a general definition of what can be a routing function. This definition captures many specific definitions of the literature (e.g., vertex dependent, input-dependent, source-dependent, path-dependent etc.). Using our definition, we give a necessary and sufficient condition which characterizes deadlock-free routing functions. Our theory embraces, at a high level, most of the theories related to deadlock avoidance in wormhole-routed networks previously derived in the literature. In particular, it applies not only to one-to-one routing, but also to one-to-many routing. The latter paradigm is used to solve the multicast problem with the path-based or tree-based facility
Keywords :
concurrency control; multiprocessor interconnection networks; system recovery; deadlock avoidance; distributed memory parallel computers; internode distance; multicast problem; necessary and sufficient condition; one-to-many routing; one-to-one routing; routers; tree-based facility; wormhole-routed networks; Algorithm design and analysis; Computer Society; Computer networks; Concurrent computing; Delay; Hardware; Intelligent networks; Routing; Sufficient conditions; System recovery;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on