• DocumentCode
    3310369
  • Title

    Automatically extracting summaries with a novel unsupervised framework

  • Author

    Peng Li ; Yinglin Wang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1778
  • Lastpage
    1782
  • Abstract
    In this paper, we propose a novel approach to automatic generation of aspect-oriented summaries from multiple documents. We first develop an event-aspect LDA model to cluster sentences into aspects. We then use extended LexRank algorithm to rank the sentences in each cluster. We use Integer Linear Programming for sentence selection. Key features of our method include automatic grouping of semantically related sentences and sentence ranking based on extension of random walk model. Also, we implement a new sentence compression algorithm which use dependency tree instead of parser tree. We compare our method with three baseline methods. Quantitative evaluation based on Rouge metric demonstrate the effectiveness and advantages of our method.
  • Keywords
    aspect-oriented programming; document handling; grammars; integer programming; linear programming; pattern clustering; unsupervised learning; LexRank algorithm; aspect-oriented summaries; cluster sentences; event-aspect LDA model; integer linear programming; multidocument summarization; parser tree; random walk model; sentence compression algorithm; sentence ranking; unsupervised framework; Clustering algorithms; Compression algorithms; Computational linguistics; Computational modeling; Integer linear programming; Pragmatics; Redundancy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-61284-180-9
  • Type

    conf

  • DOI
    10.1109/FSKD.2011.6019843
  • Filename
    6019843