• DocumentCode
    1909758
  • Title

    Multi-Document Summarization as Applied in Information Retrieval

  • Author

    Zhou, Dan ; Li, Lei

  • Author_Institution
    Center for Intelligence Sci. & Technol. Res., Beijing Univ. of Posts & Telecommun., Beijing
  • fYear
    2007
  • fDate
    Aug. 30 2007-Sept. 1 2007
  • Firstpage
    203
  • Lastpage
    208
  • Abstract
    In this paper we presented the use of multi-document summarization as postprocessing step in information retrieval (IR). We examined the differences between requirements for general multi-document summarization and requirements when it is applied for IR, and highlighted the requirements for clustering and context information extraction, which is much helpful to the users for browsing and searching relative results. To generate this type of summary, we first cluster the retrieved documents by their topics using a repeated bisection algorithm, and extract the centroid words for each cluster. The final summary is generated on the base of the query words and the cluster centroids, containing query-centered information as well as context information.
  • Keywords
    abstracting; document handling; pattern clustering; query processing; centroid words; clustering requirements; context information extraction; information retrieval; multidocument summarization; query words; repeated bisection algorithm; Clustering algorithms; Data mining; Explosions; Guidelines; Information filtering; Information filters; Information retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-1610-3
  • Electronic_ISBN
    978-1-4244-1611-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2007.4368034
  • Filename
    4368034