• DocumentCode
    3046654
  • Title

    Analysis of Automated Evaluation for Multi-document Summarization Using Content-Based Similarity

  • Author

    Qiu, Li-Qing ; Pang, Bin

  • Author_Institution
    Beihang Univ., Beijing
  • fYear
    2008
  • fDate
    10-15 Feb. 2008
  • Firstpage
    60
  • Lastpage
    63
  • Abstract
    We introduce an automated evaluation method based on content similarity, and construct a vector space of words, on which we compute cosine similarity of automated summaries and human summaries. The method is tested on DUC 2005 data, and produces acceptable results, which may avoid some shortcomings of n-gram. We also test the effects of stopwords and stemming.
  • Keywords
    document handling; DUC 2005 data; automated evaluation analysis; content-based similarity; multidocument summarization; Costs; Functional analysis; Humans; Large-scale systems; NIST; Natural languages; Performance evaluation; Programming; Testing; Vocabulary; automated evaluation; content -based similarity; multi-document summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Society, 2008 Second International Conference on the
  • Conference_Location
    Sainte Luce
  • Print_ISBN
    978-0-7695-3087-1
  • Type

    conf

  • DOI
    10.1109/ICDS.2008.9
  • Filename
    4456020