DocumentCode :
2353785
Title :
CEC: Continuous eventual checkpointing for data stream processing operators
Author :
Sebepou, Zoe ; Magoutis, Kostas
Author_Institution :
Inst. of Comput. Sci. (ICS), Found. for Res. & Technol.-Hellas (FORTH), Heraklion, Greece
fYear :
2011
fDate :
27-30 June 2011
Firstpage :
145
Lastpage :
156
Abstract :
The checkpoint roll-backward methodology is the underlying technology of several fault-tolerance solutions for continuous stream processing systems today, implemented either using the memories of replica nodes or a distributed file system. In this scheme the recovering node loads its most recent checkpoint and requests log replay to reach a consistent pre-failure state. Challenges with that technique include its complexity (typically implemented via copy-on-write), the associated overhead (exception handling under state updates), and limits to the frequency of checkpointing. The latter limit affects the amount of information that needs to be replayed leading to long recovery times. In this work we introduce continuous eventual checkpointing (CEC), a novel mechanism to provide fault-tolerance guarantees by taking continuous incremental state checkpoints with minimal pausing of operator processing. We achieve this by separating operator state into independent parts and producing frequent independent partial checkpoints of them. Our results show that our method can achieve low overhead fault-tolerance with adjustable checkpoint intensity, trading off recovery time with performance.
Keywords :
checkpointing; fault tolerant computing; network operating systems; CEC; checkpoint roll-backward methodology; continuous eventual checkpointing; continuous stream processing systems; data stream processing operators; distributed file system; Aggregates; Checkpointing; Complexity theory; Fault tolerance; Fault tolerant systems; Instruction sets; Production; Continuous Stream Processing; Fault-Tolerance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems & Networks (DSN), 2011 IEEE/IFIP 41st International Conference on
Conference_Location :
Hong Kong
ISSN :
1530-0889
Print_ISBN :
978-1-4244-9232-9
Electronic_ISBN :
1530-0889
Type :
conf
DOI :
10.1109/DSN.2011.5958214
Filename :
5958214
Link To Document :
بازگشت