Abstract :
Internet plagiarism and paper industrypsilas development lead to the extension of plagiarism phenomenon. However, Chinese anti-plagiarism is still in the initial stages of development.Moreover, plagiarism activities involve directly copying and semantic plagiarizing, so the definition of plagiarism cannot be unified. From 2006 to 2008, based on two patents, we implement an anti-plagiarism system named ROST AntiP which covers 18.8 billion Web pages and 4.9 million literatures, presents flexible match technology based on attribute value strings, can flexibly define plagiarism rule, and can implement fuzzy detection. This system has been practically using in several editorial office and universities. According to the practical collecting data, we find the PlagTrendHot phenomenon and the plagiarism-first-page phenomenon, thereby we improve the macroscopical algorithm to solve detecting speed problem. Besides, we accurately estimate the context bound errors involved in fuzzy matching, and preliminarily achieve the practice implementation goal.
Keywords :
Internet; copy protection; law; Chinese antiplagiarism; Internet plagiarism; PlagTrendHot phenomenon; ROST AntiP; Web pages; antiplagiarism system; context bound errors; fuzzy detection; fuzzy matching; paper industry development; plagiarism law; plagiarism-first-page phenomenon; semantic plagiarizing; Computer science education; Educational institutions; Information processing; Natural languages; Paper technology; Plagiarism; Prototypes; Pulp and paper industry; Software libraries; Web pages; academics morality; anti-plagiarism; electronic study; plagiarism; the law of plagiarism;