The Wayback Machine - https://web.archive.org/web/20221203190727/https://github.com/topics/deduplication
Here are
293 public repositories
matching this topic...
Fast, secure, efficient backup program
Deduplicating archiver with compression and authenticated encryption.
Updated
Dec 3, 2022
Python
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Updated
Nov 24, 2022
Python
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Extremely fast tool to remove duplicates and other lint from your filesystem
A powerful duplicate file finder and an enhanced fork of 'fdupes'.
Simple, configuration-driven backup software for servers and workstations
Updated
Dec 2, 2022
Python
A fast high compression read-only file system
Data deduplication engine, supporting optional compression and public key encryption.
Updated
Aug 25, 2022
Rust
A powerful and modular toolkit for record linkage and duplicate detection in Python
Updated
Apr 19, 2022
Python
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Updated
May 5, 2021
JavaScript
Config driven, easy backup cli for restic.
Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
Updated
Dec 2, 2022
Python
A list of free data matching and record linkage software.
rustic - fast, encrypted, deduplicated backups powered by pure Rust
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Updated
Jun 7, 2020
Python
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
Updated
Dec 2, 2022
PLpgSQL
Improve this page
Add a description, image, and links to the
deduplication
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
deduplication
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.