DocumentCode :
2680483
Title :
Prediction of Developer Participation in Issues of Open Source Projects
Author :
Schwerz, André Luis ; Liberato, Rafael ; Wiese, Igor Scaliante ; Steinmacher, Igor ; Gerosa, Marco Aurélio ; Ferreira, João Eduardo
fYear :
2012
fDate :
15-18 Oct. 2012
Firstpage :
109
Lastpage :
114
Abstract :
Developers of distributed open source projects use management and issues tracking tool to communicate. These tools provide a large volume of unstructured information that makes the triage of issues difficult, increasing developers´ overhead. This problem is common to online communities based on volunteer participation. This paper shows the importance of the content of comments in an open source project to build a classifier to predict the participation for a developer in an issue. To design this prediction model, we used two machine learning algorithms called Naive Bayes and J48. We used the data of three Apache Hadoop subprojects to evaluate the use of the algorithms. By applying our approach to the most active developers of these subprojects we have achieved an accuracy ranging from 79% to 96%. The results indicate that the content of comments in issues of open source projects is a relevant factor to build a classifier of issues for developers.
Keywords :
learning (artificial intelligence); public domain software; Apache Hadoop subprojects; developer participation; distributed open source projects; machine learning algorithm; naive Bayes; prediction model; tracking tool; unstructured information; volunteer participation; Argon; Bayesian methods; Electronic mail; Machine learning algorithms; Predictive models; Random access memory; Software; Content analysis; issue tracking classifier; machine learning; prediction model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Collaborative Systems (SBSC), 2012 Brazilian Symposium on
Conference_Location :
Sao Paulo
Print_ISBN :
978-1-4673-4696-2
Type :
conf
DOI :
10.1109/SBSC.2012.27
Filename :
6391736
Link To Document :
بازگشت