• DocumentCode
    3442292
  • Title

    How Much Do Code Repositories Include Peripheral Modifications?

  • Author

    Kusunoki, Noa ; Hotta, Kazuhiro ; Higo, Y. ; Kusumoto, Shinji

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
  • Volume
    2
  • fYear
    2013
  • fDate
    2-5 Dec. 2013
  • Firstpage
    19
  • Lastpage
    24
  • Abstract
    In the last decade, a variety of studies on mining software repositories has been conducted. Mining repositories has a potential to obtain useful knowledge for the future development and maintenance. When software repositories are mined, large commits in them are often excluded from mining targets because large commits include merging and we believe that large commits include peripheral modifications, which may affect negative impacts on mining code repositories. However, if large commits include code modifications, excluding large commits loses such modifications unintentionally. Moreover, such data cleansing assumes that there are no peripheral modifications in small commits. In this paper, we investigate how much peripheral modifications are included in commits in code repositories. As a result, we found that excluding large commits is insufficient to remove hindrances in commits for mining code repositories.
  • Keywords
    data mining; software maintenance; code modifications; code repositories; peripheral modifications; software development; software maintenance; software repositories mining; Data mining; Educational institutions; Maintenance engineering; Merging; Software; Software engineering; White spaces; Large commit; Mining software repositories; Source code analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering Conference (APSEC), 2013 20th Asia-Pacific
  • Conference_Location
    Bangkok
  • ISSN
    1530-1362
  • Print_ISBN
    978-1-4799-2143-0
  • Type

    conf

  • DOI
    10.1109/APSEC.2013.106
  • Filename
    6754345