DocumentCode
3442292
Title
How Much Do Code Repositories Include Peripheral Modifications?
Author
Kusunoki, Noa ; Hotta, Kazuhiro ; Higo, Y. ; Kusumoto, Shinji
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
Volume
2
fYear
2013
fDate
2-5 Dec. 2013
Firstpage
19
Lastpage
24
Abstract
In the last decade, a variety of studies on mining software repositories has been conducted. Mining repositories has a potential to obtain useful knowledge for the future development and maintenance. When software repositories are mined, large commits in them are often excluded from mining targets because large commits include merging and we believe that large commits include peripheral modifications, which may affect negative impacts on mining code repositories. However, if large commits include code modifications, excluding large commits loses such modifications unintentionally. Moreover, such data cleansing assumes that there are no peripheral modifications in small commits. In this paper, we investigate how much peripheral modifications are included in commits in code repositories. As a result, we found that excluding large commits is insufficient to remove hindrances in commits for mining code repositories.
Keywords
data mining; software maintenance; code modifications; code repositories; peripheral modifications; software development; software maintenance; software repositories mining; Data mining; Educational institutions; Maintenance engineering; Merging; Software; Software engineering; White spaces; Large commit; Mining software repositories; Source code analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering Conference (APSEC), 2013 20th Asia-Pacific
Conference_Location
Bangkok
ISSN
1530-1362
Print_ISBN
978-1-4799-2143-0
Type
conf
DOI
10.1109/APSEC.2013.106
Filename
6754345
Link To Document