• DocumentCode
    2346013
  • Title

    When process data quality affects the number of bugs: Correlations in software engineering datasets

  • Author

    Bachmann, Adrian ; Bernstein, Abraham

  • Author_Institution
    Dept. of Inf., Univ. of Zurich, Zurich, Switzerland
  • fYear
    2010
  • fDate
    2-3 May 2010
  • Firstpage
    62
  • Lastpage
    71
  • Abstract
    Software engineering process information extracted from version control systems and bug tracking databases are widely used in empirical software engineering. In prior work, we showed that these data are plagued by quality deficiencies, which vary in its characteristics across projects. In addition, we showed that those deficiencies in the form of bias do impact the results of studies in empirical software engineering. While these findings affect software engineering researchers the impact on practitioners has not yet been substantiated. In this paper we, therefore, explore (i) if the process data quality and characteristics have an influence on the bug fixing process and (ii) if the process quality as measured by the process data has an influence on the product (i.e., software) quality. Specifically, we analyze six Open Source as well as two Closed Source projects and show that process data quality and characteristics have an impact on the bug fixing process: the high rate of empty commit messages in Eclipse, for example, correlates with the bug report quality. We also show that the product quality - measured by number of bugs reported - is affected by process data quality measures. These findings have the potential to prompt practitioners to increase the quality of their software process and its associated data quality.
  • Keywords
    configuration management; program debugging; public domain software; software quality; Eclipse; bug fixing process; bug report quality; bug tracking databases; closed source projects; correlation; information extraction; open source projects; process data quality; software engineering datasets; version control systems; Cascading style sheets; Computer bugs; Data engineering; Informatics; Open source software; Prediction algorithms; Software engineering; Software measurement; Software quality; Software testing; case study; correlation; mining software repositories; process quality; product quality;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mining Software Repositories (MSR), 2010 7th IEEE Working Conference on
  • Conference_Location
    Cape Town
  • Print_ISBN
    978-1-4244-6802-7
  • Electronic_ISBN
    978-1-4244-6803-4
  • Type

    conf

  • DOI
    10.1109/MSR.2010.5463286
  • Filename
    5463286