DocumentCode :
3452614
Title :
Characterization and Prediction of Issue-Related Risks in Software Projects
Author :
Choetkiertikul, Morakot ; Hoa Khanh Dam ; Tran, Truyen ; Ghose, Aditya
Author_Institution :
Sch. of Comput. Sci. & Software Eng., Univ. of Wollongong, Wollongong, NSW, Australia
fYear :
2015
fDate :
16-17 May 2015
Firstpage :
280
Lastpage :
291
Abstract :
Identifying risks relevant to a software project and planning measures to deal with them are critical to the success of the project. Current practices in risk assessment mostly rely on high-level, generic guidance or the subjective judgements of experts. In this paper, we propose a novel approach to risk assessment using historical data associated with a software project. Specifically, our approach identifies patterns of past events that caused project delays, and uses this knowledge to identify risks in the current state of the project. A set of risk factors characterizing “risky” software tasks (in the form of issues) were extracted from five open source projects: Apache, Duraspace, JBoss, Moodle, and Spring. In addition, we performed feature selection using a sparse logistic regression model to select risk factors with good discriminative power. Based on these risk factors, we built predictive models to predict if an issue will cause a project delay. Our predictive models are able to predict both the risk impact (i.e. the extend of the delay) and the likelihood of a risk occurring. The evaluation results demonstrate the effectiveness of our predictive models, achieving on average 48%-81% precision, 23%-90% recall, 29%-71% F-measure, and 70%-92% Area Under the ROC Curve. Our predictive models also have low error rates: 0.39-0.75 for Macro-averaged Mean Cost-Error and 0.7-1.2 for Macro-averaged Mean Absolute Error.
Keywords :
feature selection; project management; regression analysis; risk management; software development management; Apache; Duraspace; JBoss; Moodle; Spring; feature selection; issue-related risks; macroaveraged mean absolute error; macroaveraged mean cost-error; open source projects; project delay; project delays; risk assessment; risk factor selection; risks identification; risky software tasks; software projects; sparse logistic regression model; Delays; Feature extraction; Logistics; Predictive models; Risk management; Software; Springs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Software Repositories (MSR), 2015 IEEE/ACM 12th Working Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/MSR.2015.33
Filename :
7180087
Link To Document :
بازگشت