• DocumentCode
    551920
  • Title

    Weighted graph model based sentence clustering and ranking for document summarization

  • Author

    Ge, Shuzhi Sam ; Zhang, Zhengchen ; He, Hongsheng

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2011
  • fDate
    16-18 Aug. 2011
  • Firstpage
    90
  • Lastpage
    95
  • Abstract
    This paper proposes a sentence ranking and clustering based summarization method that extracts essential sentences from a document. To discover central sentences, a weighted undirected graph that takes sentence similarities and the discourse relationship between sentences as the weights of edges is constructed for the given document. A graph-ranking algorithm is implemented to calculate the scores of sentences. We also build a matrix for the document, and an algorithm based on Sparse Non-negative Matrix Factorization is introduced to cluster the sentences in the document. High ranked sentences of each cluster are selected to comprise the summarization of the document. The experimental results on the Document Understanding Conference (DUC) 2001 data set demonstrate the effectiveness of the document summarization algorithm.
  • Keywords
    document handling; graph theory; matrix decomposition; clustering based summarization method; document summarization; document understanding conference; essential sentence extraction; graph-ranking algorithm; sentence clustering; sentence ranking; sparse nonnegative matrix factorization; weighted graph model; weighted undirected graph; Clustering algorithms; Connectors; Feature extraction; Semantics; Sparse matrices; Strontium; Symmetric matrices;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Interaction Sciences (ICIS), 2011 4th International Conference on
  • Conference_Location
    Busan
  • Print_ISBN
    978-1-4577-0480-2
  • Electronic_ISBN
    978-89-88678-45-9
  • Type

    conf

  • Filename
    6014538