DocumentCode :
3244782
Title :
EROS: an efficient method for minimizing the replay time based on the replay dependence relation
Author :
Thoai, Nam ; Kranzlmüller, Dieter ; Volkert, Jens
Author_Institution :
Johannes Kepler Univ., Linz, Austria
fYear :
2003
fDate :
5-7 Feb. 2003
Firstpage :
23
Lastpage :
30
Abstract :
Debugging parallel programs is still a challenge although many debugging techniques and corresponding tools have been developed. One reason why parallel programs are difficult to debug is the irreproducibility effect, which is caused by nondeterminism occurring at process interaction. Solutions to this problem are provided by so-called record and replay mechanisms or replay techniques. However, the rather long waiting time during replay often prohibits inclusion of these techniques to debugging tools, which must provide some degree of interactivity for user´s investigations. This paper analyzes the possibility of combining debugging with checkpointing to shorten waiting time during re-executions. Related work in this area either cannot ensure a short waiting time or creates other effects on the autonomy of processes. The EROS approach introduces the replay dependence relation, which allows us to minimize the waiting time without the restrictions above. In fact, EROS provides a small upper bound of the replay time during debugging nondeterministic message passing programs.
Keywords :
message passing; parallel programming; program debugging; system recovery; EROS; checkpointing; debugging; interactivity; irreproducibility effect; nondeterministic message passing programs; parallel programs; record and replay mechanisms; replay dependence relation; replay time minimization; upper bound; Checkpointing; Computer bugs; Debugging; Europe; Fault tolerance; Large-scale systems; Message passing; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing, 2003. Proceedings. Eleventh Euromicro Conference on
Conference_Location :
Genova, Italy
ISSN :
1066-6192
Print_ISBN :
0-7695-1875-3
Type :
conf
DOI :
10.1109/EMPDP.2003.1183561
Filename :
1183561
Link To Document :
بازگشت