DocumentCode :
479789
Title :
Multi-document Summarization Based on Word Feature Mining
Author :
Wang, Meng ; Wang, Xiaorong ; Li, Chungui ; Zhang, Zengfang
Author_Institution :
Guang Xi Univ. of Technol., Liuzhou
Volume :
1
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
743
Lastpage :
746
Abstract :
This paper discusses an approach to multi-document summarization that builds on understanding word as feature deeply. We created 7 basic word features using the frequency, position information, event information and topic information. Then choose logistic regression model to compute words value. The summarizer gives a score of sentence by words value, and combines score and redundancy of sentence to produce summarization. The evaluation of summaries uses three parameters which are N-gram co-occurrence statistics, term word coverage and high frequency word coverage. The experiment results show the systempsilas has more effectiveness and feasibility.
Keywords :
data mining; document handling; feature extraction; information analysis; regression analysis; N-gram co-occurrence statistics; event information; frequency word coverage; logistic regression model; multidocument summarization; position information; term word coverage; topic information; word feature mining; Aggregates; Computer science; Data mining; Frequency; Helium; Logistics; Paper technology; Position measurement; Software engineering; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
Type :
conf
DOI :
10.1109/CSSE.2008.1232
Filename :
4721856
Link To Document :
بازگشت