• DocumentCode
    2282309
  • Title

    An Ontology-Based Approach to Text Summarization

  • Author

    Hennig, Leonhard ; Umbrath, Winfried ; Wetzker, Robert

  • Author_Institution
    DAI Labor, Tech. Univ. Berlin, Berlin
  • Volume
    3
  • fYear
    2008
  • fDate
    9-12 Dec. 2008
  • Firstpage
    291
  • Lastpage
    294
  • Abstract
    Extractive text summarization aims to create a condensed version of one or more source documents by selecting the most informative sentences. Research in text summarization has therefore often focused on measures of the usefulness of sentences for a summary. We present an approach to sentence extraction that maps sentences to nodes of a hierarchical ontology. By considering ontology attributes we are able to improve the semantic representation of a sentence´s information content. The classifier that maps sentences to the taxonomy is trained using search engines and is therefore very flexible and not bound to a specific domain. In our experiments, we train an SVM classifier to identify summary sentences using ontology-based sentence features. Our experimental results show that the ontology-based extraction of sentences outperforms baseline classifiers, leading to higher Rouge scores of summary extracts.
  • Keywords
    feature extraction; ontologies (artificial intelligence); pattern classification; search engines; support vector machines; text analysis; SVM classifier; extractive text summarization; hierarchical ontology; search engines; semantic representation; Abstracts; Data mining; Intelligent agent; Ontologies; Search engines; Support vector machine classification; Support vector machines; Taxonomy; Tornadoes; Tropical cyclones; hierarchical classification; ontology; sentence extraction; summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-0-7695-3496-1
  • Type

    conf

  • DOI
    10.1109/WIIAT.2008.175
  • Filename
    4740782