DocumentCode :
3406700
Title :
The impact of tangled code changes
Author :
Herzig, Kim ; Zeller, A.
Author_Institution :
Microsoft Res., Cambridge, UK
fYear :
2013
fDate :
18-19 May 2013
Firstpage :
121
Lastpage :
130
Abstract :
When interacting with version control systems, developers often commit unrelated or loosely related code changes in a single transaction. When analyzing the version history, such tangled changes will make all changes to all modules appear related, possibly compromising the resulting analyses through noise and bias. In an investigation of five open-source Java projects, we found up to 15% of all bug fixes to consist of multiple tangled changes. Using a multi-predictor approach to untangle changes, we show that on average at least 16.6% of all source files are incorrectly associated with bug reports. We recommend better change organization to limit the impact of tangled changes.
Keywords :
Java; configuration management; program debugging; public domain software; bug fixes; bug reports; change organization; multiple tangled changes; multipredictor approach; open-source JAVA projects; source files; version control systems; Accuracy; History; Manuals; Noise; Open source software; Partitioning algorithms; Mining software repositories; bias; data quality; noise; tangled code changes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Software Repositories (MSR), 2013 10th IEEE Working Conference on
Conference_Location :
San Francisco, CA
ISSN :
2160-1852
Print_ISBN :
978-1-4799-0345-0
Type :
conf
DOI :
10.1109/MSR.2013.6624018
Filename :
6624018
Link To Document :
بازگشت