Title :
A principled approach to mining from noisy logs using Heuristics Miner
Author :
Weber, Piotr ; Bordbar, Behzad ; Tino, Peter
Author_Institution :
Sch. of Comput. Sci., Univ. of Birmingham, Birmingham, UK
Abstract :
Noise is a challenge for process mining algorithms, but there is no standard definition of noise nor accepted way to quantify it. This means it is not possible to mine with confidence from event logs which may not record the underlying process correctly. We discuss one way of thinking about noise in process mining. We consider mining from a `noisy log´ as learning a probability distribution over traces, representing the true process, from a log which is a sample from multiple distributions: the `true´ process model and one or more `noise´ models. We apply this using a probabilistic analysis of the Heuristics Miner algorithm, and demonstrate on a simple example. We show that for a given model it is possible to predict how much data is needed to mine the underlying model without the noise, and identify differences in the the robustness of Heuristics Miner to different types of noise.
Keywords :
data mining; learning (artificial intelligence); probability; event logs; heuristics miner algorithm; noisy logs; principled approach; probabilistic analysis; probability distribution; process mining algorithms; Algorithm design and analysis; Business; Data mining; Joints; Noise; Noise measurement; Probabilistic logic;
Conference_Titel :
Computational Intelligence and Data Mining (CIDM), 2013 IEEE Symposium on
Conference_Location :
Singapore
DOI :
10.1109/CIDM.2013.6597226