Title :
Predicting Wikipedia Editor´s Editing Interest Based on Factor Graph Model
Author :
Haisu Zhang ; Sheng Zhang ; Zhaolin Wu ; Liwei Huang ; Yutao Ma
Author_Institution :
1st Dept., Acad. of Nat. Defense Inf., Wuhan, China
fDate :
June 27 2014-July 2 2014
Abstract :
Recruiting or recommending appropriate latent editors who can edit a specific entry (or called article) plays an important role in improving the quality of Wikipedia entries. To predict an editor´s editing interest for Wikipedia entries, this paper proposes an Interest Prediction Factor Graph (IPFG) model, which is characterized by editor´s social properties, hyperlinks between Wikipedia entries, categories of an entry and other important features. Furthermore, the paper suggests a parameter learning algorithm based on the gradient descent and Loopy Sum-Product algorithms for factor graphs. The experiment on a Wikipedia dataset shows that, the average prediction accuracy (F1-Measure) of the IPFG model could be up to 87.5%, which is about 35% higher than that of a collaborative filtering approach. Moreover, the paper analyses how incomplete social properties and editing bursts affect the prediction accuracy of the IPFG model. What we found would provide a useful insight into effective Wikipedia article tossing, and improve the quality of those entries that belong to specific categories by means of collective collaboration.
Keywords :
Web sites; data mining; learning (artificial intelligence); F1-measure; IPFG model; Wikipedia dataset; Wikipedia editor editing interest prediction; Wikipedia entries; Wikipedia entry quality improvement; article tossing; average prediction accuracy; collective collaboration; editor hyperlinks; editor social properties; entry categories; entry quality improvement; gradient descent algorithm; incomplete social properties; interest prediction factor graph model; latent editor recommendation; latent editor recruitment; loopy sum-product algorithm; parameter learning algorithm; Accuracy; Electronic publishing; Encyclopedias; Internet; Prediction algorithms; Predictive models; Factor Graph; Interest Prediction; Probabilistic Graphical Model; Social Network Mining; Wikipedia;
Conference_Titel :
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5056-0
DOI :
10.1109/BigData.Congress.2014.63