• DocumentCode
    768064
  • Title

    Software-implemented fault-tolerance and separate recovery strategies enhance maintainability [substation automation]

  • Author

    Deconinck, Geert ; De Florio, Vincenzo ; Botti, Oliver

  • Author_Institution
    Dept. of Electr. Eng., Katholieke Univ., Leuven, Belgium
  • Volume
    51
  • Issue
    2
  • fYear
    2002
  • fDate
    6/1/2002 12:00:00 AM
  • Firstpage
    158
  • Lastpage
    165
  • Abstract
    This paper describes a novel approach to software-implemented fault tolerance for distributed applications. This new approach can be used to enhance the flexibility and maintainability of the target applications in a cost-effective way. This is reached through a framework-approach including: (1) a library of fault tolerance functions; (2) a middleware application coordinating these functions; and (3) a language for the expression of nonfunctional services, including configuration, error recovery and fault injection. This framework-approach increases the availability and reliability of the application at a justifiable cost, also thanks to the re-usability of the components in different target systems. This framework-approach further increases the maintainability due to the separation of the functional behavior from the recovery strategies that are executed when an error is detected, because the modifications to functional and nonfunctional behavior are, to some extent, independent, and hence less complex to deal with. The resulting tool matches well, e.g., with current industrial requirements for embedded distributed systems, calling for adaptable and reusable software components. The "integration of this approach in an automation system of a substation for electricity distribution" reports this experience. This case study shows in particular the ability of the configuration-and-recovery language ARIEL to allow adaptability to changes in the environment. This framework-approach is also useful in the context of distributed automation systems that are interconnected via a nondedicated network
  • Keywords
    distributed control; power system control; software fault tolerance; software maintenance; software reusability; substations; ARIEL configuration-and-recovery language; adaptable reusable software components; availability; electricity distribution; embedded distributed systems; error recovery; fault tolerance functions; middleware; recovery strategies; reliability; software maintainability; software-implemented fault-tolerance; substation automation; Application software; Availability; Computer industry; Costs; Embedded software; Fault tolerance; Maintenance; Middleware; Software libraries; Substation automation;
  • fLanguage
    English
  • Journal_Title
    Reliability, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9529
  • Type

    jour

  • DOI
    10.1109/TR.2002.1011520
  • Filename
    1011520