Title :
Graph-based submodular selection for extractive summarization
Author :
Lin, Hui ; Bilmes, Jeff ; Xie, Shasha
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
We propose a novel approach for unsupervised extractive summarization. Our approach builds a semantic graph for the document to be summarized. Summary extraction is then formulated as optimizing submodular functions defined on the semantic graph. The optimization is theoretically guaranteed to be near-optimal under the framework of submodularity. Extensive experiments on the ICSI meeting summarization task on both human transcripts and automatic speech recognition (ASR) outputs show that the graph-based submodular selection approach consistently outperforms the maximum marginal relevance (MMR) approach, a concept-based approach using integer linear programming (ILP), and a recursive graph-based ranking algorithm using Google´s PageRank.
Keywords :
graph theory; integer programming; linear programming; speech recognition; text analysis; PageRank; automatic speech recognition; document summarization; graph-based submodular selection; human transcript; integer linear programming; optimization; recursive graph-based ranking algorithm; semantic graph; unsupervised extractive summarization; Automatic speech recognition; Computer science; Game theory; Greedy algorithms; Humans; Integer linear programming; Polynomials; Redundancy;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373486