DocumentCode
3579811
Title
A New Approach for Multi-document Summarization Based on Latent Semantic Analysis
Author
Shuchu Xiong ; Yihui Luo
Author_Institution
Coll. of Comput. & Inf., Hunan Univ. of Commerce, Changsha, China
Volume
1
fYear
2014
Firstpage
177
Lastpage
180
Abstract
Multi-document summary plays an increasingly important role with the exponential document growth on the web. Among many traditional multi-document summarization techniques, the latent semantic analysis (LSA) is a unique duo to its using latent semantic information instead of original feature, which results in a better performance. However, since those approaches based on LSA evaluate and select sentence individually, none of them is able to remove the redundant sentences. In this paper, we propose a new method to evaluate a sentence subset based on its capacity to reproduce term projections on right singular vectors. Finally, the experiments on DUC2002 and DUC2004 datasets validate the effectiveness of our proposed methods.
Keywords
Internet; document handling; indexing; DUC2002 datasets; DUC2004 datasets; LSA; latent semantic analysis; latent semantic information; multidocument summarization; redundant sentence removal; sentence selection; sentence subset evaluation; Algorithm design and analysis; Cost function; Data mining; Heuristic algorithms; Semantics; Vectors; forward selection; latent semantic analysis; multi-document summrization; singular value decomposition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Design (ISCID), 2014 Seventh International Symposium on
Print_ISBN
978-1-4799-7004-9
Type
conf
DOI
10.1109/ISCID.2014.27
Filename
7064167
Link To Document