DocumentCode
2852070
Title
Automatic process model discovery from textual methodologies
Author
Viorica Epure, Elena ; Martin-Rodilla, Patricia ; Hug, Charlotte ; Deneckere, Rebecca ; Salinesi, Camille
Author_Institution
Centre de Rech. en Inf., Univ. Paris 1 Pantheon-Sorbonne, Paris, France
fYear
2015
fDate
13-15 May 2015
Firstpage
19
Lastpage
30
Abstract
Process mining has been successfully used in automatic knowledge discovery and in providing guidance or support. The known process mining approaches rely on processes being executed with the help of information systems thus enabling the automatic capture of process traces as event logs. However, there are many other fields such as Humanities, Social Sciences and Medicine where workers follow processes and log their execution manually in textual forms instead. The problem we tackle in this paper is mining process instance models from unstructured, text-based process traces. Using natural language processing with a focus on the verb semantics, we created a novel unsupervised technique TextProcessMiner that discovers process instance models in two steps: 1.ActivityMiner mines the process activities; 2.ActivityRelationshipMiner mines the sequence, parallelism and mutual exclusion relationships between activities. We employed technical action research through which we validated and preliminarily evaluated our proposed technique in an Archaeology case. The results are very satisfactory with 88% correctly discovered activities in the log and a process instance model that adequately reflected the original process. Moreover, the technique we created emerged as domain independent.
Keywords
archaeology; data mining; information systems; natural language processing; text analysis; unsupervised learning; ActivityRelationshipMiner; TextProcessMiner; archaeology; automatic knowledge discovery; automatic process trace capture; event logs; information systems; mining process instance model; mutual exclusion relationships; natural language processing; parallelism; process instance model; process mining approach; unstructured text-based process traces; unsupervised technique; verb semantics; Information systems; Machining; Manuals; Natural language processing; Semantics; Substrates; natural language processing; process mining; process mining technique; process model; technical action research;
fLanguage
English
Publisher
ieee
Conference_Titel
Research Challenges in Information Science (RCIS), 2015 IEEE 9th International Conference on
Conference_Location
Athens
Type
conf
DOI
10.1109/RCIS.2015.7128860
Filename
7128860
Link To Document