Title :
The Case for Distributed Execution Replay Using a Virtual Machine
Author :
Loriant, Nicolas ; Menaud, Jean-Marc
Author_Institution :
Obasco Group, EMN-INRIA, Nantes
Abstract :
Debugging grid systems is complex, mainly because of the probe effect and non reproducible execution. The probe effect arises when an attempt to monitor a system changes the behavior of that system. Moreover, two executions of a distributed system with identical inputs may behave differently due to non determinism. Execution replay is a technique developed to facilitate the debugging of distributed systems: a debugger first monitors the execution of a distributed system and then replays it identically. Existing approaches to execution replay only partially address the probe effect and irreproducibility problem. In this paper, we argue for execution replay of distributed systems using a virtual machine approach. The VM approach addresses the irreproducibility problem it does not completely avoid the probe effect. Nevertheless, we believe that the full control of the virtual hardware addresses the probe issue well enough to debug distributed system errors
Keywords :
grid computing; program debugging; virtual machines; distributed system; execution replay debugger; grid system; irreproducibility problem; virtual machine; Context; Control systems; Debugging; Error correction; Hardware; Monitoring; Probes; Random number generation; Virtual machining; Virtual manufacturing;
Conference_Titel :
Enabling Technologies: Infrastructure for Collaborative Enterprises, 2006. WETICE '06. 15th IEEE International Workshops on
Conference_Location :
Manchester
Print_ISBN :
0-7695-2623-3
DOI :
10.1109/WETICE.2006.66