DocumentCode
650739
Title
On the Relationship between the Vocabulary of Bug Reports and Source Code
Author
Moreno, L. ; Bandara, Wathsala ; Haiduc, Sonia ; Marcus, Andrian
Author_Institution
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
fYear
2013
fDate
22-28 Sept. 2013
Firstpage
452
Lastpage
455
Abstract
Text retrieval (TR) techniques have been widely used to support concept and bug location. When locating bugs, developers often formulate queries based on the bug descriptions. More than that, a large body of research uses bug descriptions to evaluate bug location techniques using TR. The implicit assumption is that the bug descriptions and the relevant source code files share important words. In this paper, we present an empirical study that explores this conjecture. We found that bug reports share more terms with the patched classes than with the other classes in the system. Furthermore, we found that the class names are more likely to share terms with the bug descriptions than other code locations, while more verbose parts of the code (e.g., comments) will share more words. We also found that the shared terms may be better predictors for bug location than some TR techniques.
Keywords
information retrieval; program debugging; text analysis; TR techniques; bug descriptions; bug location techniques; bug reports; source code; text retrieval techniques; Art; Computer bugs; Data collection; Large scale integration; Software systems; Vocabulary; Bug location; source code vocabulary; text retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Maintenance (ICSM), 2013 29th IEEE International Conference on
Conference_Location
Eindhoven
ISSN
1063-6773
Type
conf
DOI
10.1109/ICSM.2013.70
Filename
6676930
Link To Document