DocumentCode
559863
Title
Analysis on Identification Technologies of Program Code Similarity
Author
Chen, Gang ; Zhang, Yuqing ; Wang, Xin
Author_Institution
Comput. Sci. & Technol., China Univ. of Geosci. (Beijing), Beijing, China
Volume
1
fYear
2011
fDate
24-25 Sept. 2011
Firstpage
188
Lastpage
191
Abstract
The technologies for identifying program code similarity are significant in college practice courses and software design. This paper introduces the definition of code similarity, summarizes eight means of plagiarism of program code, and presents the Pyramid of Code Plagiarism. It also compares some identification technologies and systems suggested by previous scholars, and analyses identification technologies in the past five years which are based on Abstract Syntax Tree, Program Dependence Graph, BP Neural Network and XML. The paper further points out the lacks of identification accuracy, test sets and an official definition of similarity in this field. It also looks into the trend of this technology, such as a combination of different techniques and application of new algorithms.
Keywords
XML; backpropagation; computer science education; educational courses; educational technology; neural nets; software engineering; BP nural ntwork; XML; abstract syntax tree; college practice courses; identification technologies; plagiarism; program code similarity; program dependence graph; software design; Algorithm design and analysis; Biological neural networks; Computers; Plagiarism; Semantics; Syntactics; XML; identification technology; program code; similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology, Computer Engineering and Management Sciences (ICM), 2011 International Conference on
Conference_Location
Nanjing, Jiangsu
Print_ISBN
978-1-4577-1419-1
Type
conf
DOI
10.1109/ICM.2011.240
Filename
6113388
Link To Document