Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Look into SIPs for better plagiarism detection #184

Open
Bhargav-Rao opened this issue Feb 6, 2019 · 0 comments
Open

Look into SIPs for better plagiarism detection #184

Bhargav-Rao opened this issue Feb 6, 2019 · 0 comments

Comments

@Bhargav-Rao
Copy link
Member

From Cody Gray:

I still feel like there's a way to do it with Statistically Improbable Phrase matching, which is exactly what I do manually. But maybe it's naïve to think that's straightforward to implement. For this application, it isn't necessary to match the whole post, or even score the whole post. Just flag it if you find a SIP in the post that occurs elsewhere. If you find several, the score could be increased.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant