• DocumentCode
    2208664
  • Title

    Software change classification using hunk metrics

  • Author

    Ferzund, Javed ; Ahsan, Syed Nadeem ; Wotawa, Franz

  • Author_Institution
    Inst. for Software Technol., Graz Univ. of Technol., Graz, Austria
  • fYear
    2009
  • fDate
    20-26 Sept. 2009
  • Firstpage
    471
  • Lastpage
    474
  • Abstract
    Change management is a challenging task in software maintenance. Changes are made to the software during its whole life. Some of these changes introduce errors in the code which result in failures. Software changes are composed of small code units called hunks, dispersed in source code files. In this paper we present a technique for classifying software changes based on hunk metrics. We classify individual hunks as buggy or bug-free, thus we provide an approach for bug prediction at the smallest level of granularity. We introduce a set of hunk metrics and build classification models based on these metrics. Classification models are built using logistic regression and random forests. We evaluated the performance of our approach on 7 open source software projects. Our classification approach can classify hunks as buggy or bug free with 81 percent accuracy, 77 percent buggy hunk precision and 67 percent buggy hunk recall on average. Most of the hunk metrics are significant predictors of bugs but the set of significant metrics varies among different projects.
  • Keywords
    management of change; regression analysis; software maintenance; software metrics; source coding; change management; hunk metrics; logistic regression; random forests; software change classification; software maintenance; source code files; Computer bugs; Data mining; Feature extraction; Logistics; Machine learning; Open source software; Predictive models; Software maintenance; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance, 2009. ICSM 2009. IEEE International Conference on
  • Conference_Location
    Edmonton, AB
  • ISSN
    1063-6773
  • Print_ISBN
    978-1-4244-4897-5
  • Electronic_ISBN
    1063-6773
  • Type

    conf

  • DOI
    10.1109/ICSM.2009.5306274
  • Filename
    5306274