DocumentCode :
2223580
Title :
Personalized PageRank Based Multi-document Summarization
Author :
Liu, Yong ; Wang, Xiaolei ; Zhang, Jin ; Xu, Hongbo
Author_Institution :
Sch. of Mech. Eng. & Autom., Northeastern Univ., Shenyang
fYear :
2008
fDate :
14-15 July 2008
Firstpage :
169
Lastpage :
173
Abstract :
This paper presents a novel multi-document summarization approach based on personalized pagerank (PPRSum). In this algorithm, we uniformly integrate various kinds of information in the corpus. At first, we train a salience model of sentence global features based on Naive Bayes Model. Secondly, we generate a relevance model for each corpus utilizing the query of it. Then, we compute the personalized prior probability for each sentence in the corpus utilizing the salience model and the relevance model both. With the help of personalized prior probability, a Personalized PageRank ranking process is performed depending on the relationships among all sentences in the corpus. Additionally, the redundancy penalty is imposed on each sentence. The summary is produced by choosing the sentences with both high query-focused information richness and high information novelty. Experiments on DUC2007 are performed and the ROUGE evaluation results show that PPRSum ranks between the 1st and the 2nd systems on DUC2007 main task.
Keywords :
Bayes methods; probability; query processing; search engines; text analysis; multidocument summarization; naive Bayes model; personalized pagerank; personalized prior probability; query processing; redundancy penalty; relevance model; salience model; text analysis; Automation; Clustering algorithms; Computers; Conferences; Data mining; Frequency; Mechanical engineering; Partitioning algorithms; Performance evaluation; Naïve Bayes model; Personalized PageRank; personalized prior probability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing and Systems, 2008. WSCS '08. IEEE International Workshop on
Conference_Location :
Huangshan
Print_ISBN :
978-0-7695-3316-2
Electronic_ISBN :
978-0-7695-3316-2
Type :
conf
DOI :
10.1109/WSCS.2008.32
Filename :
4570834
Link To Document :
بازگشت