DocumentCode :
2984917
Title :
Software Fingerprinting
Author :
Gil, Joseph ; Gorovoy, Alexander ; Itai, Alon
Author_Institution :
Dept. of Comput. Sci., Technion - Israel Inst. of Technol., Haifa
fYear :
2006
fDate :
16-19 Oct. 2006
Firstpage :
69
Lastpage :
73
Abstract :
A new cross-compression metric, based on Lempel-Ziv-Welch compression, is suggested as a method for detecting programming style. In essence, the process of compression of one program is used to build a a string table which is then used to compress the text of another program. The resulting compression ratio defines a measure of coherency between the two programs. In using the metric, we define the stylistic separation condition, and show how it can be used to automatically assign a code fragment to the program it belongs. This condition is demonstrated to hold in an ensemble of 21 large programs, spanning over a million lines of code. We also develop a new linear-time compression algorithm which compresses programs based on their syntactical structure. This algorithm is insensitive to superficial similarities, such as in identifier names, and is competitive with standard compressors. Other than its applications in achieving better compression rates of program source, we use this new algorithm to measure similarity between programs. The stylistic separation condition holds for this algorithm as well.
Keywords :
data compression; program compilers; software engineering; string matching; Lempel-Ziv-Welch compression; code fragment; cross-compression metric; linear-time compression algorithm; program compression; software fingerprinting; string table; stylistic separation condition; syntactical structure; Algorithm design and analysis; Compression algorithms; Compressors; Computer languages; Computer science; Dictionaries; Fingerprint recognition; Information technology; Plagiarism; Programming profession;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: Research and Education, 2006. ITRE '06. International Conference on
Conference_Location :
Tel-Aviv
Print_ISBN :
1-4244-0858-X
Electronic_ISBN :
1-4244-0859-8
Type :
conf
DOI :
10.1109/ITRE.2006.381536
Filename :
4266297
Link To Document :
بازگشت