DocumentCode :
660696
Title :
Noise in Bug Report Data and the Impact on Defect Prediction Results
Author :
Ramler, Rudolf ; Himmelbauer, Johannes
Author_Institution :
Software Competence Center Hagenberg, Hagenberg, Austria
fYear :
2013
fDate :
23-26 Oct. 2013
Firstpage :
173
Lastpage :
180
Abstract :
The potential benefits of defect prediction have created widespread interest in research and generated a considerable number of empirical studies. Applications with real-world data revealed a central problem: Real-world data is "dirty" and often of poor quality. Noise in bug report data is a particular problem for defect prediction since it effects the correct classification of software modules. Is the module actually defective or not? In this paper we examine different causes of noise encountered when predicting defects in an industrial software system and we provide an overview of commonly reported causes in related work. Furthermore we conduct an experiment to explore the impact of class noise on the predictions performance. The experiment shows that the prediction results for the studied system remain reliable even at a noise level of 20% probability of incorrect links between bug reports and modules.
Keywords :
pattern classification; program debugging; software reliability; bug report data; defect prediction; noise level; real-world data; software modules classification; Data models; Databases; Noise; Noise measurement; Predictive models; Software; class noise; data quality; defect prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Measurement and the 2013 Eighth International Conference on Software Process and Product Measurement (IWSM-MENSURA), 2013 Joint Conference of the 23rd International Workshop on
Conference_Location :
Ankara
Type :
conf
DOI :
10.1109/IWSM-Mensura.2013.33
Filename :
6693237
Link To Document :
بازگشت