DocumentCode :
1958917
Title :
A Comparative Study of Supervised Learning Algorithms for Re-opened Bug Prediction
Author :
Xin Xia ; Lo, Daniel ; Xinyu Wang ; Xiaohu Yang ; Shanping Li ; Jianling Sun
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
fYear :
2013
fDate :
5-8 March 2013
Firstpage :
331
Lastpage :
334
Abstract :
Bug fixing is a time-consuming and costly job which is performed in the whole life cycle of software development and maintenance. For many systems, bugs are managed in bug management systems such as Bugzilla. Generally, the status of a typical bug report in Bugzilla changes from new to assigned, verified and closed. However, some bugs have to be reopened. Reopened bugs increase the software development and maintenance cost, increase the workload of bug fixers, and might even delay the future delivery of a software. Only a few studies investigate the phenomenon of reopened bug reports. In this paper, we evaluate the effectiveness of various supervised learning algorithms to predict if a bug report would be reopened. We choose 7 state-of-the-art classical supervised learning algorithm in machine learning literature, i.e., kNN, SVM, Simple Logistic, Bayesian Network, Decision Table, CART and LWL, and 3 ensemble learning algorithms, i.e., AdaBoost, Bagging and Random Forest, and evaluate their performance in predicting reopened bug reports. The experiment results show that among the 10 algorithms, Bagging and Decision Table (IDTM) achieve the best performance. They achieve accuracy scores of 92.91% and 92.80%, respectively, and reopened bug reports F-Measure scores of 0.735 and 0.732, respectively. These results improve the reopened bug reports F-Measure of the state-of-the-art approaches proposed by Shihab et al. by up to 23.53%.
Keywords :
belief networks; decision tables; learning (artificial intelligence); pattern classification; program debugging; software maintenance; support vector machines; trees (mathematics); AdaBoost algorithm; Bayesian network algorithm; Bugzilla; CART algorithm; F-measure; LWL algorithm; SVM algorithm; bagging algorithm; bug fixing; bug management system; classification-and-regression tree; decision table algorithm; k-nearest neighbor; kNN algorithm; random forest algorithm; reopened bug prediction; simple logistic algorithm; software development; software maintenance; supervised learning algorithm; support vector machines; Accuracy; Bagging; Computer bugs; Feature extraction; Prediction algorithms; Software algorithms; Supervised learning; bug reports; classification; comparative study; reopened reports; supervised learning algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Maintenance and Reengineering (CSMR), 2013 17th European Conference on
Conference_Location :
Genova
ISSN :
1534-5351
Print_ISBN :
978-1-4673-5833-0
Type :
conf
DOI :
10.1109/CSMR.2013.43
Filename :
6498482
Link To Document :
بازگشت