A Methodology to Discover Semantic Features from Textual Resources

Author

Vicient, Carlos ; Sanchez, Dominick ; Moreno, Antonio

Author_Institution

Dept. d´´Eng. Inf. i Mat., Univ. Rovira i Virgili, Tarragona, Spain

fYear

2011

fDate

1-2 Dec. 2011

Firstpage

39

Lastpage

44

Abstract

Data analysis algorithms focused on processing textual data rely on the extraction of relevant features from text and the appropriate association to their formal semantics. In this paper, a method to assist this task, annotating extracted textual features with concepts from a background ontology, is presented. The method is automatic and unsupervised and it has been designed in a generic way, so it can be applied to textual resources ranging from plain text to semi-structured resources (like Wikipedia articles). The system has been tested with tourist destinations and Wikipedia articles showing promising results.

Keywords

data analysis; feature extraction; ontologies (artificial intelligence); travel industry; Wikipedia articles; background ontology; data analysis algorithms; formal semantic features discovery; textual data processing; textual feature extraction annotation; textual resources; tourist destinations; Electronic publishing; Encyclopedias; Feature extraction; Internet; Ontologies; Semantics; Feature discovery; Information Extraction; Ontologie; Wikipedia;

fLanguage

English

Publisher

ieee

Conference_Titel

Semantic Media Adaptation and Personalization (SMAP), 2011 Sixth International Workshop on

Conference_Location

Pontevedra

Print_ISBN

978-1-4577-1372-9

Type

conf

DOI

10.1109/SMAP.2011.13

Filename

6103500