DocumentCode :
713931
Title :
Iterative joint extraction of entities, relationships and coreferences from text sources
Author :
Zitnik, Slavko ; Bajec, Marko
Author_Institution :
Fac. of Comput. & Inf. Sci., Univ. of Ljubljana, Ljubljana, Slovenia
fYear :
2015
fDate :
13-15 May 2015
Firstpage :
412
Lastpage :
422
Abstract :
Machine understanding of textual documents has been challenging since the early computer era. Since the information extraction research field emerged it has inferred multiple natural language processing tasks, such as named entities recognition, relationships extraction and coreference resolution. Even though for the purpose of the end-to-end information extraction all of the three tasks are crucial, existing work has been focusing merely on one specific task at the time or at best on their connection in a pipeline. In this paper we introduce a novel iterative and joint information extraction system that interconnects all the three tasks together using iterative feature functions which use the advantage of the intermediate extractions. Furthermore, we introduce a special transformation of data into skip-mention sequences to enable the extraction of relations and coreferences using fast first-order graphical models. Additionally, the system uses an ontology as its knowledge source, as a list of inferred extraction rules, and as a data schema of extracted results. Experimental results show that the accuracy of extractions improves after each iteration. In particular, our model obtained a 15% error reduction on named entity recognition over individual models.
Keywords :
iterative methods; natural language processing; ontologies (artificial intelligence); text analysis; coreference resolution; end-to-end information extraction; first-order graphical model; iterative feature function; iterative joint extraction; machine understanding; multiple natural language processing task; named entity recognition; ontology; relationships extraction; textual document; Data mining; Feature extraction; Information retrieval; Joints; Ontologies; Organizations; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Research Challenges in Information Science (RCIS), 2015 IEEE 9th International Conference on
Conference_Location :
Athens
Type :
conf
DOI :
10.1109/RCIS.2015.7128902
Filename :
7128902
Link To Document :
بازگشت