• DocumentCode
    3375283
  • Title

    An introduction to fault tolerant parallel simulation with EcliPSe

  • Author

    Knop, Felipe ; Mascarenhas, Edward ; Rego, Vernon ; Sunderam, V.S.

  • Author_Institution
    Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
  • fYear
    1994
  • fDate
    11-14 Dec. 1994
  • Firstpage
    700
  • Lastpage
    707
  • Abstract
    The paper presents an overview of the ACES parallel software system and, in particular, an introduction to the EcliPSe layer of the system. The ACES system is a fault tolerant, layered software system for heterogeneous network based cluster computing. The EcliPSe toolkit, which resides on an upper layer, was constructed specifically for replication based and domain decomposition based simulation applications. It is not, however, restricted to simulations and supports any message passing form of parallel processing. By taking advantage of networks of heterogeneous machines, generally "idle" workstations, EcliPSe programs can achieve supercomputer level performance with little programming effort-that is, low programming effort was a motivating factor in EcliPSe\´s design. We present an overview of key application level features in EcliPSe, support for fault tolerant simulation, and performance results for three simple but large scale and representative experiments.
  • Keywords
    digital simulation; message passing; parallel programming; software fault tolerance; ACES parallel software system; EcliPSe; domain decomposition based simulation applications; fault tolerant parallel simulation; heterogeneous machine; heterogeneous network based cluster computing; layered software system; message passing form; supercomputer level performance; Application software; Computational modeling; Computer networks; Fault tolerance; Fault tolerant systems; Message passing; Parallel processing; Software systems; Supercomputers; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Simulation Conference Proceedings, 1994. Winter
  • Print_ISBN
    0-7803-2109-X
  • Type

    conf

  • DOI
    10.1109/WSC.1994.717416
  • Filename
    717416