Title : 
An Ontology-Based Approach to Text Summarization
         
        
            Author : 
Hennig, Leonhard ; Umbrath, Winfried ; Wetzker, Robert
         
        
            Author_Institution : 
DAI Labor, Tech. Univ. Berlin, Berlin
         
        
        
        
        
        
        
            Abstract : 
Extractive text summarization aims to create a condensed version of one or more source documents by selecting the most informative sentences. Research in text summarization has therefore often focused on measures of the usefulness of sentences for a summary. We present an approach to sentence extraction that maps sentences to nodes of a hierarchical ontology. By considering ontology attributes we are able to improve the semantic representation of a sentence´s information content. The classifier that maps sentences to the taxonomy is trained using search engines and is therefore very flexible and not bound to a specific domain. In our experiments, we train an SVM classifier to identify summary sentences using ontology-based sentence features. Our experimental results show that the ontology-based extraction of sentences outperforms baseline classifiers, leading to higher Rouge scores of summary extracts.
         
        
            Keywords : 
feature extraction; ontologies (artificial intelligence); pattern classification; search engines; support vector machines; text analysis; SVM classifier; extractive text summarization; hierarchical ontology; search engines; semantic representation; Abstracts; Data mining; Intelligent agent; Ontologies; Search engines; Support vector machine classification; Support vector machines; Taxonomy; Tornadoes; Tropical cyclones; hierarchical classification; ontology; sentence extraction; summarization;
         
        
        
        
            Conference_Titel : 
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
         
        
            Conference_Location : 
Sydney, NSW
         
        
            Print_ISBN : 
978-0-7695-3496-1
         
        
        
            DOI : 
10.1109/WIIAT.2008.175