Title :
Efficient Process Discovery From Event Streams Using Sequential Pattern Mining
Author :
Marwan Hassani;Sergio Siccha;Florian Richter;Thomas Seidl
Author_Institution :
Data Manage. &
Abstract :
Process mining is an emerging research area that applies the well-established data mining solutions to the challenging business process modeling problems. Mining streams of business processes in the real time as they are generated is a necessity to obtain an instant knowledge from big process data. In this paper, we introduce an efficient approach for exploring and counting process fragments from a stream of events to infer a process model using the Heuristics Miner algorithm. Our novel approach, called StrProM, builds prefix-trees to extract sequential patterns of events from the stream. StrProM uses a batch-based approach to continuously update and prune these prefix-trees. The final models are generated from those trees after applying a novel decaying mechanism over their statistics. The extensive experimental evaluation demonstrates the superiority of our approach over a state-of-the-art technique in terms of execution time using a real dataset, while delivering models of a comparable quality.
Keywords :
"Data mining","Business","Heuristic algorithms","Adaptation models","Context modeling","Process modeling","Real-time systems"
Conference_Titel :
Computational Intelligence, 2015 IEEE Symposium Series on
Print_ISBN :
978-1-4799-7560-0
DOI :
10.1109/SSCI.2015.195