DocumentCode :
1606448
Title :
Multi-document summarization using sentence clustering
Author :
Gupta, V.K. ; Siddiqui, Tanveer J.
Author_Institution :
Samsung India Software Oper., Bangalore, India
fYear :
2012
Firstpage :
1
Lastpage :
5
Abstract :
This paper presents an approach to query focused multi document summarization by combining single document summary using sentence clustering. Both syntactic and semantic similarity between sentences is used for clustering. Single document summary is generated using document feature, sentence reference index feature, location feature and concept similarity feature. Sentences from single document summaries are clustered and top most sentences from each cluster are used for creating multi-document summary. We observed an average F-measure of 0.33774 on DUC 2002 multi-document dataset, which is comparable to three best performing systems reported on the same dataset.
Keywords :
document handling; grammars; pattern clustering; pattern matching; F-measure; concept similarity feature; document feature; location feature; multidocument dataset; multidocument summarization; semantic similarity; sentence clustering; sentence reference index feature; single document summary; syntactic similarity; Cancer; Feature extraction; Indexes; Information retrieval; Semantics; Syntactics; Vectors; DUC-2002; Multi document summarization; feature extraction; sentence clustering method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Human Computer Interaction (IHCI), 2012 4th International Conference on
Conference_Location :
Kharagpur
Print_ISBN :
978-1-4673-4367-1
Type :
conf
DOI :
10.1109/IHCI.2012.6481826
Filename :
6481826
Link To Document :
بازگشت