• DocumentCode
    635204
  • Title

    It´s not a bug, it´s a feature: How misclassification impacts bug prediction

  • Author

    Herzig, Kim ; Just, Sascha ; Zeller, A.

  • Author_Institution
    Saarland Univ., Saarbrücken, Germany
  • fYear
    2013
  • fDate
    18-26 May 2013
  • Firstpage
    392
  • Lastpage
    401
  • Abstract
    In a manual examination of more than 7,000 issue reports from the bug databases of five open-source projects, we found 33.8% of all bug reports to be misclassified - that is, rather than referring to a code fix, they resulted in a new feature, an update to documentation, or an internal refactoring. This misclassification introduces bias in bug prediction models, confusing bugs and features: On average, 39% of files marked as defective actually never had a bug. We discuss the impact of this misclassification on earlier studies and recommend manual data validation for future studies.
  • Keywords
    data mining; program debugging; software maintenance; bug prediction model; bug reports misclassification; documentation; internal refactoring; Computer bugs; Databases; Documentation; Inspection; Maintenance engineering; Manuals; Noise; Mining software repositories; bias; bug reports; data quality; noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering (ICSE), 2013 35th International Conference on
  • Conference_Location
    San Francisco, CA
  • Print_ISBN
    978-1-4673-3073-2
  • Type

    conf

  • DOI
    10.1109/ICSE.2013.6606585
  • Filename
    6606585