• DocumentCode
    559863
  • Title

    Analysis on Identification Technologies of Program Code Similarity

  • Author

    Chen, Gang ; Zhang, Yuqing ; Wang, Xin

  • Author_Institution
    Comput. Sci. & Technol., China Univ. of Geosci. (Beijing), Beijing, China
  • Volume
    1
  • fYear
    2011
  • fDate
    24-25 Sept. 2011
  • Firstpage
    188
  • Lastpage
    191
  • Abstract
    The technologies for identifying program code similarity are significant in college practice courses and software design. This paper introduces the definition of code similarity, summarizes eight means of plagiarism of program code, and presents the Pyramid of Code Plagiarism. It also compares some identification technologies and systems suggested by previous scholars, and analyses identification technologies in the past five years which are based on Abstract Syntax Tree, Program Dependence Graph, BP Neural Network and XML. The paper further points out the lacks of identification accuracy, test sets and an official definition of similarity in this field. It also looks into the trend of this technology, such as a combination of different techniques and application of new algorithms.
  • Keywords
    XML; backpropagation; computer science education; educational courses; educational technology; neural nets; software engineering; BP nural ntwork; XML; abstract syntax tree; college practice courses; identification technologies; plagiarism; program code similarity; program dependence graph; software design; Algorithm design and analysis; Biological neural networks; Computers; Plagiarism; Semantics; Syntactics; XML; identification technology; program code; similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology, Computer Engineering and Management Sciences (ICM), 2011 International Conference on
  • Conference_Location
    Nanjing, Jiangsu
  • Print_ISBN
    978-1-4577-1419-1
  • Type

    conf

  • DOI
    10.1109/ICM.2011.240
  • Filename
    6113388