DocumentCode :
3479705
Title :
Evolution Analysis of Homogenous Source Code and its Application to Plagiarism Detection
Author :
Ji, Jeong-Hoon ; Park, Su-Hyun ; Woo, Gyun ; Cho, Hwan-Gue
Author_Institution :
Dept. of Comput. Eng., Pusan Nat. Univ., Pusan
fYear :
2007
fDate :
11-13 Oct. 2007
Firstpage :
813
Lastpage :
818
Abstract :
Due to intelligent softwares and worldwide internet environment, unauthorized source code theft/copying and partial plagiarism is widespread. So the detecting the plagiarized source codes and software is getting important, especially in academic field. Though there have been announced lots of studies for detecting plagiarized pair of codes, we did not find somewhat more fundamental work for understanding the mechanism of plagiarism. So in order to improve the plagiarism detecting algorithm, it is desirable to reveal the whole procedure of code plagiarism. The basic idea of this paper is that software plagiarism can be considered as an evolution process of a source code. The final goal of our paper is to reconstruct the phylogenetic tree with a set of plagiarized codes. The main contribution of this paper is to propose an asymmetric code similarity measure, which enables us to guess the direction of plagiarism. For this purpose, we applied local alignment approach to detect a region of similar codes with a novel adaptive matching score matrix. To show the effectiveness and efficiency of our procedure, we conducted experiments on 20 artificially generated phylogenetic trees whose roots are selected from four program groups of ICPC (International Collegiate Programming Contest). Experiment showed that the proposed algorithm can effectively estimate the evolution direction, which enables us to identify plagiarized codes more accurately and reliably.
Keywords :
computer science education; copy protection; copyright; programming; homogenous source code; plagiarism detecting algorithm; plagiarism detection; source code copying; source code theft; Application software; Biology computing; Computer viruses; Dynamic programming; Evolution (biology); Information analysis; Information technology; Internet; Phylogeny; Plagiarism;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in the Convergence of Bioscience and Information Technologies, 2007. FBIT 2007
Conference_Location :
Jeju City
Print_ISBN :
978-0-7695-2999-8
Type :
conf
DOI :
10.1109/FBIT.2007.125
Filename :
4524212
Link To Document :
بازگشت