• DocumentCode
    479789
  • Title

    Multi-document Summarization Based on Word Feature Mining

  • Author

    Wang, Meng ; Wang, Xiaorong ; Li, Chungui ; Zhang, Zengfang

  • Author_Institution
    Guang Xi Univ. of Technol., Liuzhou
  • Volume
    1
  • fYear
    2008
  • fDate
    12-14 Dec. 2008
  • Firstpage
    743
  • Lastpage
    746
  • Abstract
    This paper discusses an approach to multi-document summarization that builds on understanding word as feature deeply. We created 7 basic word features using the frequency, position information, event information and topic information. Then choose logistic regression model to compute words value. The summarizer gives a score of sentence by words value, and combines score and redundancy of sentence to produce summarization. The evaluation of summaries uses three parameters which are N-gram co-occurrence statistics, term word coverage and high frequency word coverage. The experiment results show the systempsilas has more effectiveness and feasibility.
  • Keywords
    data mining; document handling; feature extraction; information analysis; regression analysis; N-gram co-occurrence statistics; event information; frequency word coverage; logistic regression model; multidocument summarization; position information; term word coverage; topic information; word feature mining; Aggregates; Computer science; Data mining; Frequency; Helium; Logistics; Paper technology; Position measurement; Software engineering; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering, 2008 International Conference on
  • Conference_Location
    Wuhan, Hubei
  • Print_ISBN
    978-0-7695-3336-0
  • Type

    conf

  • DOI
    10.1109/CSSE.2008.1232
  • Filename
    4721856