DocumentCode :
710094
Title :
Cleaning structured event logs: A graph repair approach
Author :
Jianmin Wang ; Shaoxu Song ; Xuemin Lin ; Xiaochen Zhu ; Jian Pei
Author_Institution :
Sch. of Software, Tsinghua Univ., Beijing, China
fYear :
2015
fDate :
13-17 April 2015
Firstpage :
30
Lastpage :
41
Abstract :
Event data are often dirty owing to various recording conventions or simply system errors. These errors may cause many serious damages to real applications, such as inaccurate provenance answers, poor profiling results or concealing interesting patterns from event data. Cleaning dirty event data is strongly demanded. While existing event data cleaning techniques view event logs as sequences, structural information do exist among events. We argue that such structural information enhances not only the accuracy of repairing inconsistent events but also the computation efficiency. It is notable that both the structure and the names (labeling) of events could be inconsistent. In real applications, while unsound structure is not repaired automatically (which needs manual effort from business actors to handle the structure error), it is highly desirable to repair the inconsistent event names introduced by recording mistakes. In this paper, we propose a graph repair approach for 1) detecting unsound structure, and 2) repairing inconsistent event name.
Keywords :
business data processing; data handling; graph theory; business actors; computation efficiency; dirty event data cleaning techniques; graph repair approach; inconsistent event name repairing; sequences; structural information; structured event log cleaning; system errors; unsound structure detection; Approximation algorithms; Cleaning; Databases; Insulation; Labeling; Maintenance engineering; Petri nets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location :
Seoul
Type :
conf
DOI :
10.1109/ICDE.2015.7113270
Filename :
7113270
Link To Document :
بازگشت