DocumentCode
635204
Title
It´s not a bug, it´s a feature: How misclassification impacts bug prediction
Author
Herzig, Kim ; Just, Sascha ; Zeller, A.
Author_Institution
Saarland Univ., Saarbrücken, Germany
fYear
2013
fDate
18-26 May 2013
Firstpage
392
Lastpage
401
Abstract
In a manual examination of more than 7,000 issue reports from the bug databases of five open-source projects, we found 33.8% of all bug reports to be misclassified - that is, rather than referring to a code fix, they resulted in a new feature, an update to documentation, or an internal refactoring. This misclassification introduces bias in bug prediction models, confusing bugs and features: On average, 39% of files marked as defective actually never had a bug. We discuss the impact of this misclassification on earlier studies and recommend manual data validation for future studies.
Keywords
data mining; program debugging; software maintenance; bug prediction model; bug reports misclassification; documentation; internal refactoring; Computer bugs; Databases; Documentation; Inspection; Maintenance engineering; Manuals; Noise; Mining software repositories; bias; bug reports; data quality; noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering (ICSE), 2013 35th International Conference on
Conference_Location
San Francisco, CA
Print_ISBN
978-1-4673-3073-2
Type
conf
DOI
10.1109/ICSE.2013.6606585
Filename
6606585
Link To Document