Title :
Supervised Lazy Random Walk for Topic-Focused Multi-document Summarization
Author :
Du, Pan ; Guo, Jiafeng ; Cheng, Xueqi
Author_Institution :
Inst. of Comput. Technol., Beijing, China
Abstract :
Topic-focused multi-document summarization aims to produce a summary given a specific topic description and a set of related documents. It has become a crucial text processing task in many real applications that can help users consume the massive information. This paper presents a novel extractive approach based on supervised lazy random walk (Super Lazy). This approach naturally combines the rich features of sentences with the intrinsic sentence graph structure in a principled way, and thus enjoys the advantages of both the existing supervised and unsupervised approaches. Moreover, our approach can achieve the three major goals of topic-focused multi-document summarization (i.e. relevance, salience and diversity) simultaneously with a unified ranking process. Experiments on the benchmark dataset TAC2008 and TAC2009 are performed and the ROUGE evaluation results demonstrate that our approach can significantly outperform both the state-of-the-art supervised and unsupervised methods.
Keywords :
information retrieval; learning (artificial intelligence); random processes; text analysis; ROUGE evaluation; SuperLazy; benchmark dataset TAC2008; benchmark dataset TAC2009; intrinsic sentence graph structure; supervised lazy random walk; text processing task; topic focused multidocument summarization; unified ranking process; Feature extraction; Humans; Lead; Measurement; Support vector machines; Training; Vectors; diversity; relevance; salience; supervised lazy random walk; topic-focused multi-document summarization;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.140