Skip to content

Improved matching performance for big datasets

Pre-release
Pre-release
Compare
Choose a tag to compare
@RobinL RobinL released this 21 Dec 12:46
· 32 commits to master since this release

Many changes:

  • Probabalistic matching now on a field-by-field basis rather than lumping all tokens into one
  • Search algorithm can now account for scores of matches found, stopping when a good match is detected
  • Better scoring algorithm that accounts for tokens which don't match and misspellings
  • Memoisation to improve performance