Title :
Error mining: Bug detection through comparison with large code databases
Author :
Breckel, Alexander
Author_Institution :
Inst. of Software Eng. & Compiler Constr., Univ. of Ulm, Ulm, Germany
Abstract :
Bugs are hard to find. Static analysis tools are capable of systematically detecting predefined sets of errors, but extending them to find new error types requires a deep understanding of the underlying programming language. Manual reviews on the other hand, while being able to reveal more individual errors, require much more time. We present a new approach to automatically detect bugs through comparison with a large code database. The source file is analyzed for similar but slightly different code fragments in the database. Frequent occurrences of common differences indicate a potential bug that can be fixed by applying the modification back to the original source file. In this paper, we give an overview of the resulting algorithm and some important implementation details. We further evaluate the circumstances under which good detection rates can be achieved. The results demonstrate that consistently high detection rates of up to 50% are possible for certain error types across different programming languages.
Keywords :
data mining; database management systems; program debugging; programming languages; bug detection; error mining; error types; individual errors; large code databases; original source file; programming language; static analysis tools; Cloning; Computer bugs; Computer languages; Context; Databases; Software engineering; Syntactics; code databases; code similarity; comparison-based bug detection; static analysis;
Conference_Titel :
Mining Software Repositories (MSR), 2012 9th IEEE Working Conference on
Conference_Location :
Zurich
Print_ISBN :
978-1-4673-1760-3
DOI :
10.1109/MSR.2012.6224278