DocumentCode :
3777281
Title :
High obfuscation plagiarism detection using multi-feature fusion based on Logical Regression model
Author :
Leilei Kong; Zhimao Lu; Haoliang Qi; Zhongyuan Han
Author_Institution :
Harbin Engineering University, Heilongjiang Institute of Technology, China
Volume :
1
fYear :
2015
Firstpage :
355
Lastpage :
359
Abstract :
The identification of high-obfuscation plagiarism seeds is one of the most difficult problems to be solved in plagiarism detection. Single feature type cannot identify the plagiarism seeds effectively because of the varied plagiarism methods used in high-obfuscation plagiarism. In this paper, a multi-features fusion method based on Logical Regression model for the high-obfuscation plagiarism seeds identification was proposed. This method used Logical Regression model to combine lexicon features, syntax features, semantics features and structure features extracted from suspicious text fragments pairs. Experiments show that the method is feasible and effective.
Keywords :
"Plagiarism","Feature extraction","Semantics","Syntactics","Fingerprint recognition","Learning systems","Training data"
Publisher :
ieee
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2015 4th International Conference on
Type :
conf
DOI :
10.1109/ICCSNT.2015.7490768
Filename :
7490768
Link To Document :
بازگشت