DocumentCode :
3009197
Title :
Mining software repositories for traceability links
Author :
Kagdi, Huzefa ; Maletic, Jonathan I. ; Sharif, Bonita
Author_Institution :
Dept. of Comput. Sci., Kent State Univ., Kent, OH
fYear :
2007
fDate :
26-29 June 2007
Firstpage :
145
Lastpage :
154
Abstract :
An approach to recover/discover traceability links between software artifacts via the examination of a software system´s version history is presented. A heuristic-based approach that uses sequential-pattern mining is applied to the commits in software repositories for uncovering highly frequent co-changing sets of artifacts (e.g., source code and documentation). If different types of files are committed together with high frequency then there is a high probability that they have a traceability link between them. The approach is evaluated on a number of versions of the open source system KDE. As a validation step, the discovered links are used to predict similar changes in the newer versions of the same system. The results show highly precision predictions of certain types of traceability links.
Keywords :
data mining; software engineering; sequential pattern mining; software artifacts; software repositories; software system; traceability links; Data mining; Documentation; Frequency; History; Itemsets; Open source software; Software maintenance; Software systems; Software tools;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Program Comprehension, 2007. ICPC '07. 15th IEEE International Conference on
Conference_Location :
Banff, Alberta, BC
ISSN :
1092-8138
Print_ISBN :
0-7695-2860-0
Type :
conf
DOI :
10.1109/ICPC.2007.28
Filename :
4268249
Link To Document :
بازگشت