DocumentCode
2653233
Title
A Methodology to Discover Semantic Features from Textual Resources
Author
Vicient, Carlos ; Sanchez, Dominick ; Moreno, Antonio
Author_Institution
Dept. d´´Eng. Inf. i Mat., Univ. Rovira i Virgili, Tarragona, Spain
fYear
2011
fDate
1-2 Dec. 2011
Firstpage
39
Lastpage
44
Abstract
Data analysis algorithms focused on processing textual data rely on the extraction of relevant features from text and the appropriate association to their formal semantics. In this paper, a method to assist this task, annotating extracted textual features with concepts from a background ontology, is presented. The method is automatic and unsupervised and it has been designed in a generic way, so it can be applied to textual resources ranging from plain text to semi-structured resources (like Wikipedia articles). The system has been tested with tourist destinations and Wikipedia articles showing promising results.
Keywords
data analysis; feature extraction; ontologies (artificial intelligence); travel industry; Wikipedia articles; background ontology; data analysis algorithms; formal semantic features discovery; textual data processing; textual feature extraction annotation; textual resources; tourist destinations; Electronic publishing; Encyclopedias; Feature extraction; Internet; Ontologies; Semantics; Feature discovery; Information Extraction; Ontologie; Wikipedia;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantic Media Adaptation and Personalization (SMAP), 2011 Sixth International Workshop on
Conference_Location
Pontevedra
Print_ISBN
978-1-4577-1372-9
Type
conf
DOI
10.1109/SMAP.2011.13
Filename
6103500
Link To Document