DocumentCode :
688202
Title :
DKISB: Dynamic Key Instruction Sequence Birthmark for Software Plagiarism Detection
Author :
Zhenzhou Tian ; Qinghua Zheng ; Ting Liu ; Ming Fan
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´an Jiaotong Univ., Xi´an, China
fYear :
2013
fDate :
13-15 Nov. 2013
Firstpage :
619
Lastpage :
627
Abstract :
With the burst of open source software, software plagiarism has been a serious threat to the healthy development of software industry. Software birthmark reflecting intrinsic properties of software, is an effective way for the detection of software theft. However, most of the existing software birthmarks face a series of challenges: (1) the absence of source code, (2) diversity of operating systems and programming languages, (3) various automated code obfuscation techniques. In this paper, a dynamic key instruction sequence based software birthmark (DKISB) is proposed. By introducing dynamic data flow analysis into birthmark generation, we are able to produce a high quality birthmark that is closely correlated to program semantics, making it resilient to various kinds of semantic-preserving code obfuscation techniques. Based on the Pin instrumentation framework, a DKISB based software plagiarism detection system is implemented, which generates birthmarks for both the plaintiff and defendant program, and then make the plagiarism decision according to the similarity of their birthmarks. The experimental results show that DKISB is effective to either weak obfuscation techniques like compiler optimization or strong obfuscation techniques provided by tools such as Sand Mark.
Keywords :
program compilers; public domain software; security of data; source code (software); DKISB; Sand Mark; automated code obfuscation techniques; compiler optimization; defendant program; dynamic data flow analysis; dynamic key instruction sequence; dynamic key instruction sequence birthmark; healthy development; intrinsic properties; obfuscation techniques; open source software; operating systems; pin instrumentation framework; plaintiff program; programming languages; serious threat; software birthmark; software industry; software plagiarism detection; source code; Heuristic algorithms; Java; Performance analysis; Plagiarism; Semantics; Software; Watermarking; dynamic key instruction sequence; similarity comparison; software birthmark; software plagiarism;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
Type :
conf
DOI :
10.1109/HPCC.and.EUC.2013.93
Filename :
6831975
Link To Document :
بازگشت