COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20210308091952/https://github.com/topics/duplicates
Here are
148 public repositories
matching this topic...
Copy/paste detector for programming source code.
Updated
Mar 4, 2021
TypeScript
Extremely fast tool to remove duplicates and other lint from your filesystem
Deduplication tool for yarn.lock files
Updated
Jan 31, 2021
JavaScript
Interesting (non-cryptographic) hashes implemented in pure Python.
Updated
Mar 8, 2018
Python
Quickly detect already witnessed data.
Loads async data for Redux apps focusing on preventing duplicated requests and dealing with async dependencies.
Updated
Nov 14, 2017
JavaScript
Plugin for Mongoose that turns duplicate errors into regular Mongoose validation errors
Updated
Jan 18, 2019
JavaScript
CLI utility to find duplicate files
A simple tool for detecting near-duplicate source code
Advanced similarity and duplicate source code proof of concept for our research efforts.
Updated
Jun 3, 2019
Python
Advanced Duplicate File Finder for Python
Updated
Nov 23, 2020
Python
Advanced similarity and duplicate source code at scale.
Updated
Jun 6, 2019
Scala
Vidupe is a program that can find duplicate and similar video files. V1.211 released on 2019-09-18, Windows exe here:
Tool for removing duplicate documents from Elasticsearch
Updated
Feb 16, 2021
Python
A configurable GitHub App which checks for potential issue duplicates using Damerau–Levenshtein distance algorithm.
Updated
May 21, 2019
JavaScript
GDuplicate Finder - A Groovy way to find duplicates among your computer and network shares!
Updated
Jul 12, 2018
Groovy
Find duplicate files on your computer
JS script that allows you to remove duplicates from your Last.fm scrobbles library.
Updated
Feb 15, 2020
JavaScript
Duplicate file finder written in Nim
⌚️ A faster unique() function
A CLI tool to find/remove duplicate files supporting multi-core and different algorithms (MD5, SHA256, and XXHash).
Magento 1.x module to target the URL Rewrite issue
Remove Duplicate Messages
Updated
Nov 26, 2020
JavaScript
📒 A simple app to optimize your address book and remove duplicate contacts.
Updated
Jun 17, 2020
Kotlin
Experimental JavaScript module to generate all possible variations of strings over an alphabet using an n-ary virtual tree
Updated
Jan 4, 2018
JavaScript
Updated
Jun 1, 2020
JavaScript
FS-Inspect is an easy to use tool designed to give you an overview about your files and directories (Disk Usage).
Updated
Jan 25, 2017
Ruby
A Go program to get duplicates from specified paths.
Improve this page
Add a description, image, and links to the
duplicates
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
duplicates
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.