Title :
Is my event log complete? — A probabilistic approach to process mining
Author :
Van Hee, Kees M. ; Liu, Zheng ; Sidorova, Natalia
Author_Institution :
Dept. of Math. & Comput. Sci., Eindhoven Univ. of Technol., Eindhoven, Netherlands
Abstract :
Process mining is a technique for extracting process models from event logs recorded by information systems. Process mining approaches normally rely on the assumption that the log to be mined is complete. Checking log completeness is known to be a difficult issue. Except for some trivial cases, checkable criteria for log completeness are not known. We overcome this problem by taking a probabilistic point of view. In this paper, we propose a method to compute the probability that the event log is complete. Our method provides a probabilistic lower bound for log completeness for three subclasses of Petri nets, namely, workflow nets, T-workflow nets, and S-workflow nets. Furthermore, based upon the complete log obtained by our methods, we propose two specialized mining algorithms to discover T-workflow nets and S-workflow nets, respectively. We back up our theoretical work with empirical studies that show that the probabilistic bounds computed by our method are reliable.
Keywords :
Petri nets; data mining; information systems; probability; Petri nets; S-workflow nets; T-workflow nets; information systems; probabilistic approach; probabilistic bounds; process mining; Computational modeling; Data mining; Manganese; Parallel processing; Petri nets; Probabilistic logic; Random variables; Petri nets; event log; probabilistic analysis; process mining; workflow management;
Conference_Titel :
Research Challenges in Information Science (RCIS), 2011 Fifth International Conference on
Conference_Location :
Gosier
Print_ISBN :
978-1-4244-8670-0
Electronic_ISBN :
2151-1349
DOI :
10.1109/RCIS.2011.6006848