DocumentCode :
650739
Title :
On the Relationship between the Vocabulary of Bug Reports and Source Code
Author :
Moreno, L. ; Bandara, Wathsala ; Haiduc, Sonia ; Marcus, Andrian
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
fYear :
2013
fDate :
22-28 Sept. 2013
Firstpage :
452
Lastpage :
455
Abstract :
Text retrieval (TR) techniques have been widely used to support concept and bug location. When locating bugs, developers often formulate queries based on the bug descriptions. More than that, a large body of research uses bug descriptions to evaluate bug location techniques using TR. The implicit assumption is that the bug descriptions and the relevant source code files share important words. In this paper, we present an empirical study that explores this conjecture. We found that bug reports share more terms with the patched classes than with the other classes in the system. Furthermore, we found that the class names are more likely to share terms with the bug descriptions than other code locations, while more verbose parts of the code (e.g., comments) will share more words. We also found that the shared terms may be better predictors for bug location than some TR techniques.
Keywords :
information retrieval; program debugging; text analysis; TR techniques; bug descriptions; bug location techniques; bug reports; source code; text retrieval techniques; Art; Computer bugs; Data collection; Large scale integration; Software systems; Vocabulary; Bug location; source code vocabulary; text retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Maintenance (ICSM), 2013 29th IEEE International Conference on
Conference_Location :
Eindhoven
ISSN :
1063-6773
Type :
conf
DOI :
10.1109/ICSM.2013.70
Filename :
6676930
Link To Document :
بازگشت