DocumentCode
1909758
Title
Multi-Document Summarization as Applied in Information Retrieval
Author
Zhou, Dan ; Li, Lei
Author_Institution
Center for Intelligence Sci. & Technol. Res., Beijing Univ. of Posts & Telecommun., Beijing
fYear
2007
fDate
Aug. 30 2007-Sept. 1 2007
Firstpage
203
Lastpage
208
Abstract
In this paper we presented the use of multi-document summarization as postprocessing step in information retrieval (IR). We examined the differences between requirements for general multi-document summarization and requirements when it is applied for IR, and highlighted the requirements for clustering and context information extraction, which is much helpful to the users for browsing and searching relative results. To generate this type of summary, we first cluster the retrieved documents by their topics using a repeated bisection algorithm, and extract the centroid words for each cluster. The final summary is generated on the base of the query words and the cluster centroids, containing query-centered information as well as context information.
Keywords
abstracting; document handling; pattern clustering; query processing; centroid words; clustering requirements; context information extraction; information retrieval; multidocument summarization; query words; repeated bisection algorithm; Clustering algorithms; Data mining; Explosions; Guidelines; Information filtering; Information filters; Information retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-1610-3
Electronic_ISBN
978-1-4244-1611-0
Type
conf
DOI
10.1109/NLPKE.2007.4368034
Filename
4368034
Link To Document