Title of article
External Plagiarism Detection based on Human Behaviors in Producing Paraphrases of Sentences in English and Persian Languages
Author/Authors
Shojaei, A Faculty of Computer Engineering - Najafabad Branch - Islamic Azad University - Najafabad, Iran , Safi-Esfahani, F Big Data Research Center - Najafabad Branch - Islamic Azad University - Najafabad, Iran
Pages
16
From page
451
To page
466
Abstract
With the advent of the internet and easy access to digital libraries, plagiarism has become a major issue. Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries. Generating suitable queries is the heart of this technique, and the existing methods suffer from the lack of producing accurate queries, Precision, and Speed of retrieved results. This research work proposes a framework called ParaMaker. It generates accurate paraphrases of any sentence, similar to human behaviors, and sends them to a search engine to find the plagiarism patterns. For the English language, ParaMaker is examined against six known methods with standard PAN2014 datasets. The results obtained show an improvement of 34% in terms of the Recall parameter, while the parameters Precision and Speed are maintained. In the Persian language, statements of suspicious documents are examined compared to an exact search approach. ParaMaker shows an improvement of at least 42% in Recall, while Precision and Speed are maintained.
Keywords
Sentence Paraphrase Producing , Resource Retrieval , External Plagiarism Detection , Plagiarism Detection
Journal title
Astroparticle Physics
Serial Year
2019
Record number
2453046
Link To Document