• DocumentCode
    2852070
  • Title

    Automatic process model discovery from textual methodologies

  • Author

    Viorica Epure, Elena ; Martin-Rodilla, Patricia ; Hug, Charlotte ; Deneckere, Rebecca ; Salinesi, Camille

  • Author_Institution
    Centre de Rech. en Inf., Univ. Paris 1 Pantheon-Sorbonne, Paris, France
  • fYear
    2015
  • fDate
    13-15 May 2015
  • Firstpage
    19
  • Lastpage
    30
  • Abstract
    Process mining has been successfully used in automatic knowledge discovery and in providing guidance or support. The known process mining approaches rely on processes being executed with the help of information systems thus enabling the automatic capture of process traces as event logs. However, there are many other fields such as Humanities, Social Sciences and Medicine where workers follow processes and log their execution manually in textual forms instead. The problem we tackle in this paper is mining process instance models from unstructured, text-based process traces. Using natural language processing with a focus on the verb semantics, we created a novel unsupervised technique TextProcessMiner that discovers process instance models in two steps: 1.ActivityMiner mines the process activities; 2.ActivityRelationshipMiner mines the sequence, parallelism and mutual exclusion relationships between activities. We employed technical action research through which we validated and preliminarily evaluated our proposed technique in an Archaeology case. The results are very satisfactory with 88% correctly discovered activities in the log and a process instance model that adequately reflected the original process. Moreover, the technique we created emerged as domain independent.
  • Keywords
    archaeology; data mining; information systems; natural language processing; text analysis; unsupervised learning; ActivityRelationshipMiner; TextProcessMiner; archaeology; automatic knowledge discovery; automatic process trace capture; event logs; information systems; mining process instance model; mutual exclusion relationships; natural language processing; parallelism; process instance model; process mining approach; unstructured text-based process traces; unsupervised technique; verb semantics; Information systems; Machining; Manuals; Natural language processing; Semantics; Substrates; natural language processing; process mining; process mining technique; process model; technical action research;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research Challenges in Information Science (RCIS), 2015 IEEE 9th International Conference on
  • Conference_Location
    Athens
  • Type

    conf

  • DOI
    10.1109/RCIS.2015.7128860
  • Filename
    7128860