Title :
A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction
Author :
Moser, Raimund ; Pedrycz, Witold ; Succi, Giancarlo
Author_Institution :
Free Univ. of Bolzano-Bozen, Bolzano
Abstract :
In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective defect-free. Classification models are built using three common machine learners: logistic regression, naive Bayes, and decision trees. To allow different costs for prediction errors we perform cost-sensitive classification, which proves to be very successful: >75% percentage of correctly classified files, a recall of >80%, and a false positive rate <30%. Results indicate that for the Eclipse data, process metrics are more efficient defect predictors than code metrics.
Keywords :
Bayes methods; Java; decision trees; learning (artificial intelligence); pattern classification; program diagnostics; regression analysis; software metrics; software quality; Eclipse project; Java file classification; change metrics; comparative analysis; cost-sensitive classification; decision tree; defect prediction; logistic regression; machine learning; naive Bayes method; process related software metrics; product related software metrics; software quality; static code attribute; Classification tree analysis; Costs; Java; Logistics; Permission; Predictive models; Resource management; Software engineering; Software metrics; Testing; cost-sensitive classification; defect prediction; software metrics;
Conference_Titel :
Software Engineering, 2008. ICSE '08. ACM/IEEE 30th International Conference on
Conference_Location :
Leipzig
Print_ISBN :
978-1-4244-4486-1
Electronic_ISBN :
0270-5257
DOI :
10.1145/1368088.1368114