• DocumentCode
    650739
  • Title

    On the Relationship between the Vocabulary of Bug Reports and Source Code

  • Author

    Moreno, L. ; Bandara, Wathsala ; Haiduc, Sonia ; Marcus, Andrian

  • Author_Institution
    Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
  • fYear
    2013
  • fDate
    22-28 Sept. 2013
  • Firstpage
    452
  • Lastpage
    455
  • Abstract
    Text retrieval (TR) techniques have been widely used to support concept and bug location. When locating bugs, developers often formulate queries based on the bug descriptions. More than that, a large body of research uses bug descriptions to evaluate bug location techniques using TR. The implicit assumption is that the bug descriptions and the relevant source code files share important words. In this paper, we present an empirical study that explores this conjecture. We found that bug reports share more terms with the patched classes than with the other classes in the system. Furthermore, we found that the class names are more likely to share terms with the bug descriptions than other code locations, while more verbose parts of the code (e.g., comments) will share more words. We also found that the shared terms may be better predictors for bug location than some TR techniques.
  • Keywords
    information retrieval; program debugging; text analysis; TR techniques; bug descriptions; bug location techniques; bug reports; source code; text retrieval techniques; Art; Computer bugs; Data collection; Large scale integration; Software systems; Vocabulary; Bug location; source code vocabulary; text retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance (ICSM), 2013 29th IEEE International Conference on
  • Conference_Location
    Eindhoven
  • ISSN
    1063-6773
  • Type

    conf

  • DOI
    10.1109/ICSM.2013.70
  • Filename
    6676930