• DocumentCode
    700371
  • Title

    Detecting duplicate bug reports with software engineering domain knowledge

  • Author

    Aggarwal, Karan ; Rutgers, Tanner ; Timbers, Finbarr ; Hindle, Abram ; Greiner, Russ ; Stroulia, Eleni

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Alberta, Edmonton, AB, Canada
  • fYear
    2015
  • fDate
    2-6 March 2015
  • Firstpage
    211
  • Lastpage
    220
  • Abstract
    In previous work by Alipour et al., a methodology was proposed for detecting duplicate bug reports by comparing the textual content of bug reports to subject-specific contextual material, namely lists of software-engineering terms, such as non-functional requirements and architecture keywords. When a bug report contains a word in these word-list contexts, the bug report is considered to be associated with that context and this information tends to improve bug-deduplication methods. In this paper, we propose a method to partially automate the extraction of contextual word lists from software-engineering literature. Evaluating this software-literature context method on real-world bug reports produces useful results that indicate this semi-automated method has the potential to substantially decrease the manual effort used in contextual bug deduplication while suffering only a minor loss in accuracy.
  • Keywords
    program debugging; software engineering; contextual bug deduplication; contextual word list extraction; duplicate bug report detection; semiautomated method; software engineering domain knowledge; Accuracy; Androids; Computer bugs; Context; Documentation; Feature extraction; Humanoid robots; documentation; duplicate bug reports; information retrieval; machine learning; software engineering textbooks; software literature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Analysis, Evolution and Reengineering (SANER), 2015 IEEE 22nd International Conference on
  • Conference_Location
    Montreal, QC
  • Type

    conf

  • DOI
    10.1109/SANER.2015.7081831
  • Filename
    7081831