Title :
Wide-area Nile: a case study of a wide-area data-parallel application
Author :
Amoroso, Alessandro ; Marzullo, Keith ; Ricciardi, Aleta
Author_Institution :
Dept. de Sci. dell´´Inf., Bologna Univ., Italy
Abstract :
The Nile system is a distributed environment for running very large, data-intensive applications across a network of commodity workstations. These applications process data from elementary particle collisions, generated by the Cornell Electron Storage Ring, and are used by physicists of the CLEO experiment. The applications have a simple data-parallel structure, and so Nile executes them using as much parallelism as is available. Nile currently runs at any single site. It is being used by alpha testers and is scheduled for beta release in March 1998. We describe how we are adapting this local-area Nile system to allow for wide-area, multiple site interactions. In particular, we consider the two problems of scaling and of fault tolerance
Keywords :
fault tolerant computing; high energy physics instrumentation computing; local area networks; parallel processing; wide area networks; CLEO experiment; Cornell Electron Storage Ring; Nile system; case study; data-intensive applications; distributed environment; elementary particle collisions; fault tolerance; local area network; multiple site interactions; wide area data parallel application; wide area network; workstation network; Application software; Collaboration; Computer aided software engineering; Detectors; Electron beams; Fault tolerance; Physics; Processor scheduling; Resource management; Testing;
Conference_Titel :
Distributed Computing Systems, 1998. Proceedings. 18th International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
0-8186-8292-2
DOI :
10.1109/ICDCS.1998.679794