DocumentCode :
3197574
Title :
Understanding the rationale for updating a function’s comment
Author :
Malik, Haroon ; Chowdhury, Istehad ; Tsou, Hsiao-Ming ; Zhen Ming Jiang ; Hassan, Ahmed E.
Author_Institution :
Sch. of Comput., Queen´´s Univ., Kingston, ON
fYear :
2008
fDate :
Sept. 28 2008-Oct. 4 2008
Firstpage :
167
Lastpage :
176
Abstract :
Up-to-date comments are critical for the successful evolution of a software application. When modifying a function, developers may update the comment associated with the function or may not update it. For example, comments associated with a complex function are likely to be updated more often when the function is modified to prevent the code and the comments from drifting apart. Nevertheless, the rationale behind updating a comment has never been studied. In this paper, we present a large empirical study to better understand the rationale for updating comments. We recover the code change history for four large open source projects (GCC: a compiler, FreeBSD: an operation system, PostgreSQL: a database management system, and GCluster: a clustering framework) with an average code history of 10 years. Using the Random Forests algorithm, we investigate the rationale for updating comments along three dimensions: characteristics of the changed function, characteristics of the change itself and time and code ownership characteristics. Our case study shows that we can predict with an accuracy of 80%; the likelihood of updating the comment associated with a modified function. We perform a sensitivity analysis to determine the most important attributes. Our analysis shows that the percentage of changed call dependencies and control statements, the age of the modified function and the number of co-changed functions which depend on it are the most important attributes in determining the likelihood of updating comments.
Keywords :
SQL; database management systems; pattern clustering; public domain software; random processes; FreeBSD; GCluster; PostgreSQL; compilers; database management system; function comment; open source projects; random forests algorithm; sensitivity analysis; software application; Accuracy; Application software; Clustering algorithms; Costs; Database systems; History; Programming profession; Sensitivity analysis; Software engineering; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Maintenance, 2008. ICSM 2008. IEEE International Conference on
Conference_Location :
Beijing
ISSN :
1063-6773
Print_ISBN :
978-1-4244-2613-3
Electronic_ISBN :
1063-6773
Type :
conf
DOI :
10.1109/ICSM.2008.4658065
Filename :
4658065
Link To Document :
بازگشت