Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Readme should explain meaning of scores #45

Open
soliverc opened this issue Aug 28, 2019 · 4 comments
Open

Readme should explain meaning of scores #45

soliverc opened this issue Aug 28, 2019 · 4 comments

Comments

@soliverc
Copy link

soliverc commented Aug 28, 2019

On what scale are the matches scored?

I noticed with fuzzymatcher.fuzzy_left_join my best_match_scoreranges from -0.7 to + 1.15.

What is the highest possible score in this case? Can it go higher than 1.15?

Usually for fuzzy matching I would have a cutoff of around 0.8 or 0.9., which is on a scale of 0 to 1.

@soliverc
Copy link
Author

I ran the package again today and scores range from -1.4 to +2.5. I can't figure it out!

@soliverc soliverc changed the title Question on scoring. Readme should explain meaning of scores Sep 2, 2019
@ghost
Copy link

ghost commented Jan 15, 2020

I agree, not sure how to read the scores.

@Kreisash
Copy link

Kreisash commented Aug 6, 2020

Just to reiterate that it would be great to get an idea of what the scores mean so that a comparison could be made between various matching algorithms/libraries. I find this library vastly quicker for large data sets so it's shame that this is one of the main drawbacks.

@bobcolner
Copy link

how can we change the scorer to return the true probability of a match??

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants