Title :
An execution service for a partitionable low bandwidth network
Author :
Hickey, T.M. ; van Renesse, R.
Abstract :
As the amount of scientific data grows to the point where the Internet bandwidth no longer supports its transfer it becomes necessary to make powerful computational services available near data repositories. Such services allow remote researchers to start long-running parallel computations on the data. Current execution services do not provide remote users with adequate management facilities for this style of computing. This paper describes the PEX system. It has an architecture based on partitionable group communication. We describe how PEX maintains replicated state in the face of processor failures and network partitions, and how it allows remote clients to manipulate this state. We present some performance numbers, and close with discussing related work.
Keywords :
Internet; computer network management; performance evaluation; Internet bandwidth; PEX system; data repositories; execution service; management facilities; network partitions; parallel computations; partitionable low bandwidth network; performance numbers; processor failures; Bandwidth; Computer crashes; Computer networks; Concurrent computing; Linear particle accelerator; NASA; Power system management; Satellites;
Conference_Titel :
Fault-Tolerant Computing, 1999. Digest of Papers. Twenty-Ninth Annual International Symposium on
Conference_Location :
Madison, WI, USA
Print_ISBN :
0-7695-0213-X
DOI :
10.1109/FTCS.1999.781048