An Empirical Research Study on the Effectiveness of Plagiarism Detection Algorithms

This project aims to develop a platform for detecting plagiarism on English, French and Arabic texts using Artificial Intelligence and Natural Language Processing algorithms. The project takes two or more reports as input and uses string processing algorithms and NLP technologies to produce a result indicating the level of plagiarism in the report.

Algorithms and Methods Used

The project uses various algorithms and methods for detecting plagiarism, including:

Token Count Vectorizer
Term Frequency-Inverse Document Frequency
Similar_Text Algorithm
Levenshtein Distance Algorithm
Jaccard Index Algorithm
Cosine Similarity Algorithm
Longest Common Subsequence Algorithm
Dice Coefficient Algorithm

The project also uses various preprocessing techniques, including normalization, Arabic tashkil removal, stop word removal, lemmatization, stemming, and tokenization into N-grams.

Tools and Dependencies

The project is written using PHP, JS, Bootstrap, HTML5, CSS3, SQL.

Requirements

Lemmatizer : https://github.com/writecrow/lemmatizer
PHP-ML : https://php-ml.readthedocs.io/en/latest
PHP-LCS : https://packagist.org/packages/eloquent/lcs
NLP-Tools : http://php-nlp-tools.com/documentation/
php-stemmer : https://github.com/amaccis/php-stemmer

Project Structure

This repository contains the report and source code for the project, along with the database file. The report is organized into five chapters, covering introduction and context, preprocessing, similarity calculation algorithms, user interface design, implementation, experimentation, and discussion.

Conclusion

This project demonstrates the potential of using Artificial Intelligence and Natural Language Processing algorithms for detecting plagiarism in documents. The different algorithms and methods used in the project provide a comprehensive approach for identifying plagiarized documents with high accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
Project Report.pdf		Project Report.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Empirical Research Study on the Effectiveness of Plagiarism Detection Algorithms

Algorithms and Methods Used

Tools and Dependencies

Requirements

Project Structure

Conclusion

About

Releases

Packages

Languages

p1x33l/An-Empirical-Research-Study-on-the-Effectiveness-of-Plagiarism-Detection-Algorithms

Folders and files

Latest commit

History

Repository files navigation

An Empirical Research Study on the Effectiveness of Plagiarism Detection Algorithms

Algorithms and Methods Used

Tools and Dependencies

Requirements

Project Structure

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages