DocumentCode
570232
Title
Combining a double clustering approach with sentence simplification to produce highly informative multi-document summaries
Author
Silveira, Sara Botelho ; Branco, António
Author_Institution
Dept. de Inf., Univ. of Lisbon, Lisbon, Portugal
fYear
2012
fDate
8-10 Aug. 2012
Firstpage
482
Lastpage
489
Abstract
This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach that, combined with a sentence simplification procedure, aims to generate more useful summaries. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords. Finally, the summarization process includes a sentence simplification step, which aims not only to create simpler and more incisive sentences, but also to make room for the inclusion of further relevant content in the summary. Evaluation reveals that the approach pursued produces highly informative summaries, containing relevant data and no repeated information.
Keywords
pattern clustering; text analysis; double clustering approach; multidocument summarization; sentence simplification; two-phase clustering approach; Abstracts; Clustering algorithms; Humans; Measurement; Organizations; Pragmatics; Redundancy;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4673-2282-9
Electronic_ISBN
978-1-4673-2283-6
Type
conf
DOI
10.1109/IRI.2012.6303047
Filename
6303047
Link To Document