DocumentCode
551920
Title
Weighted graph model based sentence clustering and ranking for document summarization
Author
Ge, Shuzhi Sam ; Zhang, Zhengchen ; He, Hongsheng
Author_Institution
Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore
fYear
2011
fDate
16-18 Aug. 2011
Firstpage
90
Lastpage
95
Abstract
This paper proposes a sentence ranking and clustering based summarization method that extracts essential sentences from a document. To discover central sentences, a weighted undirected graph that takes sentence similarities and the discourse relationship between sentences as the weights of edges is constructed for the given document. A graph-ranking algorithm is implemented to calculate the scores of sentences. We also build a matrix for the document, and an algorithm based on Sparse Non-negative Matrix Factorization is introduced to cluster the sentences in the document. High ranked sentences of each cluster are selected to comprise the summarization of the document. The experimental results on the Document Understanding Conference (DUC) 2001 data set demonstrate the effectiveness of the document summarization algorithm.
Keywords
document handling; graph theory; matrix decomposition; clustering based summarization method; document summarization; document understanding conference; essential sentence extraction; graph-ranking algorithm; sentence clustering; sentence ranking; sparse nonnegative matrix factorization; weighted graph model; weighted undirected graph; Clustering algorithms; Connectors; Feature extraction; Semantics; Sparse matrices; Strontium; Symmetric matrices;
fLanguage
English
Publisher
ieee
Conference_Titel
Interaction Sciences (ICIS), 2011 4th International Conference on
Conference_Location
Busan
Print_ISBN
978-1-4577-0480-2
Electronic_ISBN
978-89-88678-45-9
Type
conf
Filename
6014538
Link To Document