• DocumentCode
    570232
  • Title

    Combining a double clustering approach with sentence simplification to produce highly informative multi-document summaries

  • Author

    Silveira, Sara Botelho ; Branco, António

  • Author_Institution
    Dept. de Inf., Univ. of Lisbon, Lisbon, Portugal
  • fYear
    2012
  • fDate
    8-10 Aug. 2012
  • Firstpage
    482
  • Lastpage
    489
  • Abstract
    This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach that, combined with a sentence simplification procedure, aims to generate more useful summaries. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords. Finally, the summarization process includes a sentence simplification step, which aims not only to create simpler and more incisive sentences, but also to make room for the inclusion of further relevant content in the summary. Evaluation reveals that the approach pursued produces highly informative summaries, containing relevant data and no repeated information.
  • Keywords
    pattern clustering; text analysis; double clustering approach; multidocument summarization; sentence simplification; two-phase clustering approach; Abstracts; Clustering algorithms; Humans; Measurement; Organizations; Pragmatics; Redundancy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4673-2282-9
  • Electronic_ISBN
    978-1-4673-2283-6
  • Type

    conf

  • DOI
    10.1109/IRI.2012.6303047
  • Filename
    6303047