• DocumentCode
    257476
  • Title

    Machine learning tool and meta-heuristic based on genetic algorithms for plagiarism detection over mail service

  • Author

    Bouarara, Hadj Ahmed ; Rahmani, Amine ; Hamou, Reda Mohamed ; Amine, Achkar

  • Author_Institution
    Dept. of Comput. Sci., Tahar Moulay Univ. of Saida Algeria, Saida, Algeria
  • fYear
    2014
  • fDate
    4-6 June 2014
  • Firstpage
    157
  • Lastpage
    162
  • Abstract
    One of the most modern problems that computer science try to resolve is the plagiarism, in this article we present a new approach for automatic plagiarism detection in world of mail service. Our system is based on the n-gram character for the representation of the texts and tfidf as weighting to calculate the importance of term in the corpus, we use also a combination between the machine learning methods as a way to detect if a document is plagiarized or not, we use pan 09 corpus for the construction and evaluation of the prediction model then we simulate a meta-heuristic method based on genetic algorithms with a variations of parameters to know if it can improve the results. The main objective of our work is to protect intellectual property and improve the efficiency of plagiarism detection system.
  • Keywords
    copyright; electronic mail; genetic algorithms; learning (artificial intelligence); text analysis; Email Service; automatic plagiarism detection; genetic algorithms; machine learning tool; meta-heuristic; plagiarized document detection; text representation; Classification algorithms; Detectors; Electronic mail; Entropy; Genetic algorithms; Plagiarism; Servers; Email Service; Machine Learning; Meta-heuristics; Plagiarism Detective;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Science (ICIS), 2014 IEEE/ACIS 13th International Conference on
  • Conference_Location
    Taiyuan
  • Type

    conf

  • DOI
    10.1109/ICIS.2014.6912125
  • Filename
    6912125