Title :
Casablanca: Towards a Dataset-Independent Tool for Semantic Web Annotation
Author :
Bit-Monnot, Arthur ; Vivacqua, Adriana S. ; Da Silva, Joao Carlos P.
Author_Institution :
INSA, Univ. de Toulouse, Toulouse, France
Abstract :
Semantic Web Annotation is the process of mapping named entities that appear in free-text to universal identifiers, typically made available by a Linked Data provider such as DBpedia or Freebase. This is a time consuming process, as it is usually performed by human experts with little or no computational assistance. This paper describes Casablanca, a tool built in order to make Semantic Web Annotation an accessible process for any dataset. Casablanca´s approach is to request a minimal configuration describing a dataset. It then provides unified programming and user interfaces to perform annotation based on this configuration. To perform the necessary disambiguation between named entities, Casablanca relies on existing data available in the targeted dataset.
Keywords :
natural language processing; semantic Web; user interfaces; Casablanca; dataset-independent tool; minimal configuration; named entity disambiguation; semantic Web annotation; unified programming; user interface; Electronic publishing; Encyclopedias; Manuals; Semantic Web; Semantics; XML; named entity disambiguation; semantic web; web annotation;
Conference_Titel :
Semantic Computing (ICSC), 2013 IEEE Seventh International Conference on
Conference_Location :
Irvine, CA
DOI :
10.1109/ICSC.2013.52