The Wayback Machine - https://web.archive.org/web/20200918081943/https://github.com/dedupeio
Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned repositories

  1. πŸ†” A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 2.7k 403

  2. πŸ†” Command line tool for deduplicating CSV files

    Python 295 70

  3. πŸ†” Examples for using the dedupe library

    Python 248 165

  4. πŸ“ A Cython implementation of the affine gap string distance

    Python 42 4

  5. Forked from dirko/pyhacrf

    πŸ“ Hidden alignment conditional random field for classifying string pairs.

    Python 14 6

  6. πŸ”‰ Python wrapper for a C++ Double Metaphone

    C++ 5 2

Repositories

Top languages

Python C C++

Most used topics

Loading…

People

This organization has no public members. You must be a member to see who’s a part of this organization.

You can’t perform that action at this time.