DocumentCode :
2875485
Title :
Lightweight Task Graph Inference for Distributed Applications
Author :
Xin, Bin ; Eugster, Patrick ; Zhang, Xiangyu ; Yang, Jinlin
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
fYear :
2010
fDate :
Oct. 31 2010-Nov. 3 2010
Firstpage :
100
Lastpage :
110
Abstract :
Recent paradigm shifts in distributed computing such as the advent of cloud computing pose new challenges to the analysis of distributed executions. One important new characteristic is that the management staff of computing platforms and the developers of applications are separated by corporate boundaries. The net result is that once applications go wrong, the most readily available debugging aids for developers are the visible output of the application and any log files collected during their execution. In this paper, we propose the concept of task graphs as a foundation to represent distributed executions, and present a low overhead algorithm to infer task graphs from event log files. Intuitively, a task represents an autonomous segment of computation inside a thread. Edges between tasks represent their interactions and preserve programmers´ notion of data and control flows. Our technique leverages existing logging support where available or otherwise augments it with aspect-based instrumentation to collect events of a set of predefined types. We show how task graphs can improve the precision of anomaly detection in a request-oriented analysis of field software and help programmers understand the running of the Hadoop Distributed File System (HDFS).
Keywords :
Internet; aspect-oriented programming; data flow analysis; graph theory; inference mechanisms; Hadoop distributed file system; aspect based instrumentation; cloud computing; control flows; data flows; debugging aids; distributed computing; field software; task graph inference; Clocks; Distributed databases; Instruction sets; Java; Programming; Sockets; Synchronization; anomaly detection; distributed computing; happens-before; log analysis; task graphs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems, 2010 29th IEEE Symposium on
Conference_Location :
New Delhi
ISSN :
1060-9857
Print_ISBN :
978-0-7695-4250-8
Type :
conf
DOI :
10.1109/SRDS.2010.20
Filename :
5623382
Link To Document :
بازگشت