DocumentCode
3731330
Title
Discovering text reuse in large collections of documents: A study of theses in history sciences
Author
Anton S. Khritankov;Pavel V. Botov;Nikolay S. Surovenko;Sergey V. Tsarkov;Dmitriy V. Viuchnov;Yuri V. Chekhovich
Author_Institution
Anti-Plagiat JSC, Moscow, Russia
fYear
2015
Firstpage
26
Lastpage
32
Abstract
In this paper we investigate graphs of text reuse cases in scientific degree theses in history sciences (07.xx.xx of Russian Higher Attestation Committee topic codes). Using algorithmic and statistical methods we discovered groups of highly connected theses with large amount of text reuse between them. In addition we located works compiled from several other theses and point out sources of reuse.
Keywords
Filtering
Publisher
ieee
Conference_Titel
Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference (AINL-ISMW FRUCT), 2015
Type
conf
DOI
10.1109/AINL-ISMW-FRUCT.2015.7382965
Filename
7382965
Link To Document