Title :
High obfuscation plagiarism detection using multi-feature fusion based on Logical Regression model
Author :
Leilei Kong; Zhimao Lu; Haoliang Qi; Zhongyuan Han
Author_Institution :
Harbin Engineering University, Heilongjiang Institute of Technology, China
Abstract :
The identification of high-obfuscation plagiarism seeds is one of the most difficult problems to be solved in plagiarism detection. Single feature type cannot identify the plagiarism seeds effectively because of the varied plagiarism methods used in high-obfuscation plagiarism. In this paper, a multi-features fusion method based on Logical Regression model for the high-obfuscation plagiarism seeds identification was proposed. This method used Logical Regression model to combine lexicon features, syntax features, semantics features and structure features extracted from suspicious text fragments pairs. Experiments show that the method is feasible and effective.
Keywords :
"Plagiarism","Feature extraction","Semantics","Syntactics","Fingerprint recognition","Learning systems","Training data"
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2015 4th International Conference on
DOI :
10.1109/ICCSNT.2015.7490768