Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned

  1. dedupe dedupe Public

    🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 4k 538

  2. csvdedupe csvdedupe Public

    🆔 Command line tool for deduplicating CSV files

    Python 401 83

  3. dedupe-examples dedupe-examples Public

    🆔 Examples for using the dedupe library

    Python 393 216

  4. affinegap affinegap Public

    📐 A Cython implementation of the affine gap string distance

    Cython 58 9

  5. pyhacrf pyhacrf Public

    Forked from dirko/pyhacrf

    📐 Hidden alignment conditional random field for classifying string pairs.

    Python 24 12

  6. doublemetaphone doublemetaphone Public

    🔉 Python wrapper for a C++ Double Metaphone

    C++ 14 7

Repositories

Showing 10 of 31 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…