Title :
Automatic process model discovery from textual methodologies
Author :
Viorica Epure, Elena ; Martin-Rodilla, Patricia ; Hug, Charlotte ; Deneckere, Rebecca ; Salinesi, Camille
Author_Institution :
Centre de Rech. en Inf., Univ. Paris 1 Pantheon-Sorbonne, Paris, France
Abstract :
Process mining has been successfully used in automatic knowledge discovery and in providing guidance or support. The known process mining approaches rely on processes being executed with the help of information systems thus enabling the automatic capture of process traces as event logs. However, there are many other fields such as Humanities, Social Sciences and Medicine where workers follow processes and log their execution manually in textual forms instead. The problem we tackle in this paper is mining process instance models from unstructured, text-based process traces. Using natural language processing with a focus on the verb semantics, we created a novel unsupervised technique TextProcessMiner that discovers process instance models in two steps: 1.ActivityMiner mines the process activities; 2.ActivityRelationshipMiner mines the sequence, parallelism and mutual exclusion relationships between activities. We employed technical action research through which we validated and preliminarily evaluated our proposed technique in an Archaeology case. The results are very satisfactory with 88% correctly discovered activities in the log and a process instance model that adequately reflected the original process. Moreover, the technique we created emerged as domain independent.
Keywords :
archaeology; data mining; information systems; natural language processing; text analysis; unsupervised learning; ActivityRelationshipMiner; TextProcessMiner; archaeology; automatic knowledge discovery; automatic process trace capture; event logs; information systems; mining process instance model; mutual exclusion relationships; natural language processing; parallelism; process instance model; process mining approach; unstructured text-based process traces; unsupervised technique; verb semantics; Information systems; Machining; Manuals; Natural language processing; Semantics; Substrates; natural language processing; process mining; process mining technique; process model; technical action research;
Conference_Titel :
Research Challenges in Information Science (RCIS), 2015 IEEE 9th International Conference on
Conference_Location :
Athens
DOI :
10.1109/RCIS.2015.7128860