You are viewing a single comment's thread from:

RE: Text Similarity Analysis with C++: Detecting Duplicate Content

in #techclub8 days ago

Bro you working on a Great project, Your plagiarism detector is look well-structured and easy to follow.

I suggest you to consider adding more features like ignoring common phrases such as (The starting of an article like How are you, I hope you well, bla bla bla...) or sentences, and supporting multiple file formats.

Also, you could improve the accuracy by using more advanced algorithms like Levenshtein distance or Longest Common Subsequence. Well wish you all the best for your project.

Sort:  

Thank you so much for the valuable feedback.

This project already ignore the comma's phrases and also eliminate the common words like the, and, or, that, this etc. After eliminating these words this project check the specific words that have a specific meaning.

Also I will do my best to improve this project by adding more functionalities.

Thank you I wish success too.

That's sounds good, keep up the good work bro ...