• DocumentCode
    3280433
  • Title

    Fault tolerant distributed shared memory algorithms

  • Author

    Stumm, Michael ; Zhou, Songnian

  • Author_Institution
    Toronto Univ., Ont., Canada
  • fYear
    1990
  • fDate
    9-13 Dec 1990
  • Firstpage
    719
  • Lastpage
    724
  • Abstract
    Distributed shared memory (DSM) has received increased attention as a mechanism for interprocess communication in loosely-coupled distributed systems because of its perceived advantages over direct use of message passing or remote procedure calls. One problem with most DSM algorithms proposed to date, however, is that they do not tolerate faults. The paper extends four basic DSM algorithms to tolerate single host failures and argues that this degree of fault tolerance is sufficient for most applications. It analyzes the performance behavior of the fault tolerant DSM algorithms and shows that for some algorithms the additional overhead for fault tolerance is quite small, but that for other algorithms the extra overhead can be substantial and even unpredictable
  • Keywords
    distributed processing; fault tolerant computing; performance evaluation; distributed shared memory algorithms; fault tolerance; interprocess communication; loosely-coupled distributed systems; performance behavior; single host failures; Algorithm design and analysis; Application software; Clustering algorithms; Costs; Fault tolerance; Fault tolerant systems; Message passing; Performance analysis; Read-write memory; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 1990. Proceedings of the Second IEEE Symposium on
  • Conference_Location
    Dallas, TX
  • Print_ISBN
    0-8186-2087-0
  • Type

    conf

  • DOI
    10.1109/SPDP.1990.143633
  • Filename
    143633