The Wayback Machine - https://web.archive.org/web/20220409093604/https://github.com/dedupeio
Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned

  1. dedupe Public

    ๐Ÿ†” A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 3.3k 473

  2. ๐Ÿ†” Command line tool for deduplicating CSV files

    Python 348 79

  3. ๐Ÿ†” Examples for using the dedupe library

    Python 313 199

  4. ๐Ÿ“ A Cython implementation of the affine gap string distance

    Cython 52 7

  5. pyhacrf Public

    Forked from dirko/pyhacrf

    ๐Ÿ“ Hidden alignment conditional random field for classifying string pairs.

    Python 23 9

  6. ๐Ÿ”‰ Python wrapper for a C++ Double Metaphone

    C++ 10 6

Repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loadingโ€ฆ

Most used topics

Loadingโ€ฆ