• DocumentCode
    1810127
  • Title

    Multithreading-Enabled Active Replication for Event Stream Processing Operators

  • Author

    Brito, Andrey ; Fetzer, Christof ; Felber, Pascal

  • Author_Institution
    Syst. Eng. Group, Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2009
  • fDate
    27-30 Sept. 2009
  • Firstpage
    22
  • Lastpage
    31
  • Abstract
    Event stream processing (ESP) systems are very popular in monitoring applications. Algorithmic trading, network monitoring and sensor networks are good examples of applications that rely upon ESP systems. As these systems become larger and more widely deployed, they have to answer increasingly stronger requirements that are often difficult to satisfy. Fault-tolerance is a good example of such a non-trivial requirement. Making ESP operators fault-tolerant can add considerable performance overhead to the application. In this paper, we focus on active replication as an approach to provide fault-tolerance to ESP operators. More precisely, we address the performance costs of active replication for operators in distributed ESP applications.We use a speculation mechanism based on software transactional memory (STM) to achieve the following goals: (i) enable replicas to make progress using optimistic delivery; (ii) enable early forwarding of speculative computation results; (iii) enable active replication of multi-threaded operators using transactional executions. Experimental evaluation shows that, using this combination of mechanisms, one can implement highly efficient fault-tolerant ESP operators.
  • Keywords
    multi-threading; software fault tolerance; transaction processing; event stream processing; fault-tolerance; multithreading-enabled active replication; software transactional memory; Application software; Broadcasting; Costs; Delay; Electrostatic precipitators; Fault tolerance; Monitoring; Reliability engineering; Sensor systems and applications; Systems engineering and theory; active replication; distributed systems; event processing; fault-tolerance; parallel computing; speculation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 2009. SRDS '09. 28th IEEE International Symposium on
  • Conference_Location
    Niagara Falls, NY
  • ISSN
    1060-9857
  • Print_ISBN
    978-0-7695-3826-6
  • Type

    conf

  • DOI
    10.1109/SRDS.2009.37
  • Filename
    5283513