Title :
Version history based source code plagiarism detection in proprietary systems
Author :
Maskeri, G. ; Karnam, D. ; Viswanathan, S.A. ; Padmanabhuni, Srinivas
Author_Institution :
Infosys Labs., Infosys Ltd., Bangalore, India
Abstract :
While the advent of open source code search tools have made the source code of thousands of open source software (OSS) readily accessible, thereby increasing legitimate reuse, it has also opened up the possibility of unconscientious employees plagiarizing code from OSS repositories. Plagiarism in proprietary software would not only lead to costly lawsuits, but also undermine the credibility of the organization. Hence detecting plagiarism in proprietary software is an urgent need. Though there exist a number of techniques for detecting plagiarism in student project assignments, they do not scale well in the case of large proprietary software. Especially when code snippets are plagiarized from the large number of available open source software. In this paper we propose a novel approach that applies Mining Software Repositories (MSR) based techniques to the problem of plagiarism detection. We create a programming style profile for each maintenance engineer by mining the version history and use that to detect source code commits that are likely to be plagiarized. Such suspected code fragments can be analyzed using any of the existing plagiarism detection techniques to confirm the plagiarism and ascertain the original code.
Keywords :
data mining; organisational aspects; program diagnostics; public domain software; software maintenance; software reusability; source coding; MSR based techniques; OSS repositories; code snippets; legitimate reuse; mining software repositories based techniques; open source code search tools; open source software; organization credibility; programming style profile; proprietary systems; student project assignments; version history based source code plagiarism detection; Cloning; Conferences; Educational institutions; History; Plagiarism; Programming; Software; Author Information; Plagiarism; Version History;
Conference_Titel :
Software Maintenance (ICSM), 2012 28th IEEE International Conference on
Conference_Location :
Trento
Print_ISBN :
978-1-4673-2313-0
DOI :
10.1109/ICSM.2012.6405334