DocumentCode
3310369
Title
Automatically extracting summaries with a novel unsupervised framework
Author
Peng Li ; Yinglin Wang
Author_Institution
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
Volume
3
fYear
2011
fDate
26-28 July 2011
Firstpage
1778
Lastpage
1782
Abstract
In this paper, we propose a novel approach to automatic generation of aspect-oriented summaries from multiple documents. We first develop an event-aspect LDA model to cluster sentences into aspects. We then use extended LexRank algorithm to rank the sentences in each cluster. We use Integer Linear Programming for sentence selection. Key features of our method include automatic grouping of semantically related sentences and sentence ranking based on extension of random walk model. Also, we implement a new sentence compression algorithm which use dependency tree instead of parser tree. We compare our method with three baseline methods. Quantitative evaluation based on Rouge metric demonstrate the effectiveness and advantages of our method.
Keywords
aspect-oriented programming; document handling; grammars; integer programming; linear programming; pattern clustering; unsupervised learning; LexRank algorithm; aspect-oriented summaries; cluster sentences; event-aspect LDA model; integer linear programming; multidocument summarization; parser tree; random walk model; sentence compression algorithm; sentence ranking; unsupervised framework; Clustering algorithms; Compression algorithms; Computational linguistics; Computational modeling; Integer linear programming; Pragmatics; Redundancy;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-61284-180-9
Type
conf
DOI
10.1109/FSKD.2011.6019843
Filename
6019843
Link To Document