• DocumentCode
    2797007
  • Title

    Identifying Changed Source Code Lines from Version Repositories

  • Author

    Canfora, Gerardo ; Cerulo, Luigi ; Di Penta, Massimiliano

  • Author_Institution
    Univ. of Sannio, Benevento
  • fYear
    2007
  • fDate
    20-26 May 2007
  • Firstpage
    14
  • Lastpage
    14
  • Abstract
    Observing the evolution of software systems at different levels of granularity has been a key issue for a number of studies, aiming at predicting defects or at studying certain phenomena, such as the presence of clones or of crosscutting concerns. Versioning systems such as CVS and SVN, however, only provide information about lines added or deleted by a contributor: any change is shown as a sequence of additions and deletions. This provides an erroneous estimate of the amount of code changed. This paper shows how the evolution of changes at source code line level can be inferred from CVS repositories, by combining information retrieval techniques and the Levenshtein edit distance. The application of the proposed approach to the ArgoUML case study indicates a high precision and recall.
  • Keywords
    configuration management; information retrieval; software prototyping; software reusability; ArgoUML snapshot; concurrent versioning system; crosscutting concern; information retrieval; software evolution system; source code lines; Cloning; Collaborative software; Collaborative work; Information analysis; Information retrieval; Lab-on-a-chip; Software systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mining Software Repositories, 2007. ICSE Workshops MSR '07. Fourth International Workshop on
  • Conference_Location
    Minneapolis, MN
  • Print_ISBN
    0-7695-2950-X
  • Type

    conf

  • DOI
    10.1109/MSR.2007.14
  • Filename
    4228651