Title :
Index-based code clone detection: incremental, distributed, scalable
Author :
Hummel, Benjamin ; Juergens, Elmar ; Heinemann, Lars ; Conradt, Michael
Author_Institution :
Tech. Univ. Munchen, München, Germany
Abstract :
Although numerous different clone detection approaches have been proposed to date, not a single one is both incremental and scalable to very large code bases. They thus cannot provide real-time cloning information for clone management of very large systems. We present a novel, index-based clone detection algorithm for type 1 and 2 clones that is both incremental and scalable. It enables a new generation of clone management tools that provide real-time cloning information for very large software. We report on several case studies that show both its suitability for real-time clone detection and its scalability: on 42 MLOC of Eclipse code, average time to retrieve all clones for a file was below 1 second; on 100 machines, detection of all clones in 73 MLOC was completed in 36 minutes.
Keywords :
software maintenance; 42 MLOC; Eclipse code; clone management tool; index-based code clone detection; Cloning; Detection algorithms; Indexes; Maintenance engineering; Real time systems; Scalability; Software;
Conference_Titel :
Software Maintenance (ICSM), 2010 IEEE International Conference on
Conference_Location :
Timisoara
Print_ISBN :
978-1-4244-8630-4
Electronic_ISBN :
1063-6773
DOI :
10.1109/ICSM.2010.5609665