DocumentCode :
3579811
Title :
A New Approach for Multi-document Summarization Based on Latent Semantic Analysis
Author :
Shuchu Xiong ; Yihui Luo
Author_Institution :
Coll. of Comput. & Inf., Hunan Univ. of Commerce, Changsha, China
Volume :
1
fYear :
2014
Firstpage :
177
Lastpage :
180
Abstract :
Multi-document summary plays an increasingly important role with the exponential document growth on the web. Among many traditional multi-document summarization techniques, the latent semantic analysis (LSA) is a unique duo to its using latent semantic information instead of original feature, which results in a better performance. However, since those approaches based on LSA evaluate and select sentence individually, none of them is able to remove the redundant sentences. In this paper, we propose a new method to evaluate a sentence subset based on its capacity to reproduce term projections on right singular vectors. Finally, the experiments on DUC2002 and DUC2004 datasets validate the effectiveness of our proposed methods.
Keywords :
Internet; document handling; indexing; DUC2002 datasets; DUC2004 datasets; LSA; latent semantic analysis; latent semantic information; multidocument summarization; redundant sentence removal; sentence selection; sentence subset evaluation; Algorithm design and analysis; Cost function; Data mining; Heuristic algorithms; Semantics; Vectors; forward selection; latent semantic analysis; multi-document summrization; singular value decomposition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Design (ISCID), 2014 Seventh International Symposium on
Print_ISBN :
978-1-4799-7004-9
Type :
conf
DOI :
10.1109/ISCID.2014.27
Filename :
7064167
Link To Document :
بازگشت