DocumentCode :
3237082
Title :
Enhancing Source-Based Clone Detection Using Intermediate Representation
Author :
Selim, Gehan M K ; Foo, King Chun ; Zou, Ying
Author_Institution :
Sch. of Comput., Queens Univ., Kingston, ON, Canada
fYear :
2010
fDate :
13-16 Oct. 2010
Firstpage :
227
Lastpage :
236
Abstract :
Detecting software clones in large scale projects helps improve the maintainability of large code bases. The source code representation (e.g., Java or C files) of a software system has traditionally been used for clone detection. In this paper, we propose a technique that transforms the source code to an intermediate representation, and then reuses established source-based clone detection techniques to detect clones in the intermediate representation. The clones are mapped back to the source code and are used to augment the results reported by source-based clone detection. We demonstrate the performance of our new technique using systems from the Bellon clone evaluation benchmark. The result shows that our technique can detect Type 3 clones. Our technique has higher recall with minimal drop in precision using Bellon corpus. By examining the complete clone groups, our technique has higher precision than the standalone string based and token based clone detectors.
Keywords :
program diagnostics; software maintenance; Bellon clone evaluation benchmark; Bellon corpus; intermediate representation; large code base; software clone detection; software maintenance; source based clone detection; source code representation; Benchmark testing; Binary codes; Cloning; Detectors; Encoding; Java; Merging; intermediate representation; software clones; string based clone detection tools; token based clone detection tools;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reverse Engineering (WCRE), 2010 17th Working Conference on
Conference_Location :
Beverly, MA
ISSN :
1095-1350
Print_ISBN :
978-1-4244-8911-4
Type :
conf
DOI :
10.1109/WCRE.2010.33
Filename :
5645563
Link To Document :
بازگشت