Title :
Non-essential changes in version histories
Author :
Kawrykow, David ; Robillard, Martin P.
Author_Institution :
McGill Univ., Montréal, QC, Canada
Abstract :
Numerous techniques involve mining change data captured in software archives to assist engineering efforts, for example to identify components that tend to evolve together. We observed that important changes to software artifacts are sometimes accompanied by numerous non-essential modifications, such as local variable refactorings, or textual differences induced as part of a rename refactoring. We developed a tool-supported technique for detecting non-essential code differences in the revision histories of software systems. We used our technique to investigate code changes in over 24,000 change sets gathered from the change histories of seven long-lived open-source systems. We found that up to 15.5% of a system´s method updates were due solely to non-essential differences. We also report on numerous observations on the distribution of non-essential differences in change history and their potential impact on change-based analyses.
Keywords :
data mining; public domain software; software maintenance; software tools; change based analyses; change data mining; local variable refactorings; long lived open source systems; nonessential code difference detection; nonessential version history changes; rename refactoring; software archives; software artifacts; tool supported technique; Catalogs; Data mining; History; Java; Open source software; Programming; differencing algorithms; mining software repositories; software change analysis;
Conference_Titel :
Software Engineering (ICSE), 2011 33rd International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4503-0445-0
Electronic_ISBN :
0270-5257
DOI :
10.1145/1985793.1985842