• DocumentCode
    3579811
  • Title

    A New Approach for Multi-document Summarization Based on Latent Semantic Analysis

  • Author

    Shuchu Xiong ; Yihui Luo

  • Author_Institution
    Coll. of Comput. & Inf., Hunan Univ. of Commerce, Changsha, China
  • Volume
    1
  • fYear
    2014
  • Firstpage
    177
  • Lastpage
    180
  • Abstract
    Multi-document summary plays an increasingly important role with the exponential document growth on the web. Among many traditional multi-document summarization techniques, the latent semantic analysis (LSA) is a unique duo to its using latent semantic information instead of original feature, which results in a better performance. However, since those approaches based on LSA evaluate and select sentence individually, none of them is able to remove the redundant sentences. In this paper, we propose a new method to evaluate a sentence subset based on its capacity to reproduce term projections on right singular vectors. Finally, the experiments on DUC2002 and DUC2004 datasets validate the effectiveness of our proposed methods.
  • Keywords
    Internet; document handling; indexing; DUC2002 datasets; DUC2004 datasets; LSA; latent semantic analysis; latent semantic information; multidocument summarization; redundant sentence removal; sentence selection; sentence subset evaluation; Algorithm design and analysis; Cost function; Data mining; Heuristic algorithms; Semantics; Vectors; forward selection; latent semantic analysis; multi-document summrization; singular value decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Design (ISCID), 2014 Seventh International Symposium on
  • Print_ISBN
    978-1-4799-7004-9
  • Type

    conf

  • DOI
    10.1109/ISCID.2014.27
  • Filename
    7064167