Title :
Distributed genetic process mining
Author :
Bratosin, Carmen ; Sidorova, Natalia ; van der Aalst, Wil
Author_Institution :
Dept. of Math. & Comput. Sci., Eindhoven Univ. of Technol., Eindhoven, Netherlands
Abstract :
Process mining aims at discovering process models from data logs in order to offer insight into the real use of information systems. Most of the existing process mining algorithms fail to discover complex constructs or have problems dealing with noise and infrequent behavior. The genetic process mining algorithm overcomes these issues by using genetic operators to search for the fittest solution in the space of all possible process models. The main disadvantage of genetic process mining is the required computation time. In this paper we present a coarse-grained distributed variant of the genetic miner that reduces the computation time. The degree of the improvement obtained highly depends on the parameter values and event logs characteristics. We perform an empirical evaluation to determine guidelines for setting the parameters of the distributed algorithm.
Keywords :
data mining; distributed algorithms; genetic algorithms; information systems; coarse-grained distributed algorithm; data log; distributed genetic process mining; event log; genetic operator; information system; parameter value; process mining algorithm; Complexity theory; Computational modeling; Data mining; Genetics; Heuristic algorithms; IP networks; PROM;
Conference_Titel :
Evolutionary Computation (CEC), 2010 IEEE Congress on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6909-3
DOI :
10.1109/CEC.2010.5586250