• DocumentCode
    2653233
  • Title

    A Methodology to Discover Semantic Features from Textual Resources

  • Author

    Vicient, Carlos ; Sanchez, Dominick ; Moreno, Antonio

  • Author_Institution
    Dept. d´´Eng. Inf. i Mat., Univ. Rovira i Virgili, Tarragona, Spain
  • fYear
    2011
  • fDate
    1-2 Dec. 2011
  • Firstpage
    39
  • Lastpage
    44
  • Abstract
    Data analysis algorithms focused on processing textual data rely on the extraction of relevant features from text and the appropriate association to their formal semantics. In this paper, a method to assist this task, annotating extracted textual features with concepts from a background ontology, is presented. The method is automatic and unsupervised and it has been designed in a generic way, so it can be applied to textual resources ranging from plain text to semi-structured resources (like Wikipedia articles). The system has been tested with tourist destinations and Wikipedia articles showing promising results.
  • Keywords
    data analysis; feature extraction; ontologies (artificial intelligence); travel industry; Wikipedia articles; background ontology; data analysis algorithms; formal semantic features discovery; textual data processing; textual feature extraction annotation; textual resources; tourist destinations; Electronic publishing; Encyclopedias; Feature extraction; Internet; Ontologies; Semantics; Feature discovery; Information Extraction; Ontologie; Wikipedia;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Media Adaptation and Personalization (SMAP), 2011 Sixth International Workshop on
  • Conference_Location
    Pontevedra
  • Print_ISBN
    978-1-4577-1372-9
  • Type

    conf

  • DOI
    10.1109/SMAP.2011.13
  • Filename
    6103500