The Wayback Machine - https://web.archive.org/web/20200806230413/https://github.com/dedupeio
Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned repositories

  1. πŸ†” A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 2.7k 398

  2. πŸ†” Command line tool for deduplicating CSV files

    Python 291 69

  3. πŸ†” Examples for using the dedupe library

    Python 246 160

  4. πŸ“ A Cython implementation of the affine gap string distance

    Python 41 4

  5. Forked from dirko/pyhacrf

    πŸ“ Hidden alignment conditional random field for classifying string pairs.

    Python 14 6

  6. πŸ”‰ Python wrapper for a C++ Double Metaphone

    C++ 5 2

Repositories

Top languages

Python C C++

Most used topics

Loading…

People

This organization has no public members. You must be a member to see who’s a part of this organization.

You can’t perform that action at this time.