Title :
Extracting and enriching workflows from text
Author :
Schumacher, Pol ; Minor, Mirjam ; Schulte-Zurhausen, Eric
Author_Institution :
Inst. fur Inf., Goethe Univ. Frankfurt, Frankfurt am Main, Germany
Abstract :
This paper is on a workflow extraction framework which allows to derive a formal representation based on workflows from textual descriptions of instructions, for instance, of aircraft repair procedures from a maintenance manual. The framework applies a pipes-and-filters architecture and uses NLP (Natural Language Processing) tools to perform information extraction steps automatically. In detail, the paper presents on the step of anaphora resolution to enrich the workflow extracted so far. We introduce a lexical approach and two further approaches based on a set of association rules which are created during a statistical analysis of a corpus of workflows. The results of the approaches are compared to each other. For the evaluation, we use 37 workflows which have been created by a human expert.
Keywords :
business data processing; data mining; information retrieval; natural language processing; statistical analysis; text analysis; NLP tools; aircraft repair procedures; anaphora resolution; association rules; business process; formal representation; information extraction steps; lexical approach; natural language processing tools; pipes-and-filters architecture; statistical analysis; textual process descriptions; workflow extraction framework; Aircraft; Cognition; Maintenance engineering; Natural language processing; Pipelines; Pragmatics; Rockets;
Conference_Titel :
Information Reuse and Integration (IRI), 2013 IEEE 14th International Conference on
Conference_Location :
San Francisco, CA
DOI :
10.1109/IRI.2013.6642484